[P] AlpacaEval : An Automatic Evaluator for Instruction-following Language Models

InfluxDB - Power Real-Time Data Analytics at Scale

Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

www.influxdata.com

featured

SaaSHub - Software Alternatives and Reviews

SaaSHub helps you find the best software and product alternatives

www.saashub.com

featured

alpaca_eval

4 1,103 9.6 Jupyter Notebook

An automatic evaluator for instruction-following language models. Human-validated, high-quality, cheap, and fast.

I have been going deep in this space for my can-ai-code project and was looking at the config that WizardLM was run with: https://github.com/tatsu-lab/alpaca_eval/blob/main/src/alpaca_eval/models_configs/wizardlm-13b/configs.yaml

InfluxDB

www.influxdata.com featured

Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a more popular project.

Suggest a related project

UltraLM-13B reaches top of AlpacaEval leaderboard

3 projects | /r/LocalLLaMA | 28 Jun 2023
Show HN: Times faster LLM evaluation with Bayesian optimization

6 projects | news.ycombinator.com | 13 Feb 2024
Show HN: Mistral LLM w Assistants API and Action tool 4 autonomous requests

1 project | news.ycombinator.com | 13 Feb 2024
Ask HN: AI/ML papers to catch up with current state of AI?

3 projects | news.ycombinator.com | 15 Dec 2023
Open Source Function Calling with Intel's LLM in Javascript

1 project | /r/LocalLLaMA | 8 Dec 2023

[P] AlpacaEval : An Automatic Evaluator for Instruction-following Language Models

This page summarizes the projects mentioned and recommended in the original post on /r/LocalLLaMA
Deep Learning Evaluation foundation-models instruction-following large-language-models
Post date: 8 Jun 2023

alpaca_eval

InfluxDB

Related posts

UltraLM-13B reaches top of AlpacaEval leaderboard

Show HN: Times faster LLM evaluation with Bayesian optimization

Show HN: Mistral LLM w Assistants API and Action tool 4 autonomous requests

Ask HN: AI/ML papers to catch up with current state of AI?

Open Source Function Calling with Intel's LLM in Javascript

[P] AlpacaEval : An Automatic Evaluator for Instruction-following Language Models

This page summarizes the projects mentioned and recommended in the original post on /r/LocalLLaMA Deep Learning Evaluation foundation-models instruction-following large-language-models Post date: 8 Jun 2023

alpaca_eval

InfluxDB

Related posts

UltraLM-13B reaches top of AlpacaEval leaderboard

Show HN: Times faster LLM evaluation with Bayesian optimization

Show HN: Mistral LLM w Assistants API and Action tool 4 autonomous requests

Ask HN: AI/ML papers to catch up with current state of AI?

Open Source Function Calling with Intel's LLM in Javascript

This page summarizes the projects mentioned and recommended in the original post on /r/LocalLLaMA
Deep Learning Evaluation foundation-models instruction-following large-language-models
Post date: 8 Jun 2023