UltraLM-13B reaches top of AlpacaEval leaderboard

InfluxDB - Power Real-Time Data Analytics at Scale

Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

www.influxdata.com

featured

SaaSHub - Software Alternatives and Reviews

SaaSHub helps you find the best software and product alternatives

www.saashub.com

featured

UltraChat

3 2,108 6.7 Python

Large-scale, Informative, and Diverse Multi-round Chat Data (and Models)
alpaca_eval

4 1,103 9.6 Jupyter Notebook

An automatic evaluator for instruction-following language models. Human-validated, high-quality, cheap, and fast.

Alpaca Eval is open source and was developed by the same team who trained the alpaca model afaik. It is not like what you said in the other comment

InfluxDB

www.influxdata.com featured

Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
Local-LLM-Comparison-Colab-UI

20 876 9.1 Jupyter Notebook

Compare the performance of different LLM that can be deployed locally on consumer hardware. Run yourself with Colab WebUI.

If you want to try it out, you can use Google Colab here with Oobabooga Text Generation UI: Link (Remember to check the instruction template and generation parameters)

NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a more popular project.

Suggest a related project

[P] AlpacaEval : An Automatic Evaluator for Instruction-following Language Models

2 projects | /r/LocalLLaMA | 8 Jun 2023
UltraChat's License is now MIT

1 project | news.ycombinator.com | 11 Oct 2023
Looks like there is a new model UltraLM that topped the AlpacaEval Leaderboard

1 project | /r/LocalLLaMA | 29 Jun 2023
Show HN: Times faster LLM evaluation with Bayesian optimization

6 projects | news.ycombinator.com | 13 Feb 2024
Show HN: Mistral LLM w Assistants API and Action tool 4 autonomous requests

1 project | news.ycombinator.com | 13 Feb 2024

UltraLM-13B reaches top of AlpacaEval leaderboard

This page summarizes the projects mentioned and recommended in the original post on /r/LocalLLaMA
Deep Learning large-language-models Evaluation Chatbot foundation-models
Post date: 28 Jun 2023

UltraChat

alpaca_eval

InfluxDB

Local-LLM-Comparison-Colab-UI

Related posts

[P] AlpacaEval : An Automatic Evaluator for Instruction-following Language Models

UltraChat's License is now MIT

Looks like there is a new model UltraLM that topped the AlpacaEval Leaderboard

Show HN: Times faster LLM evaluation with Bayesian optimization

Show HN: Mistral LLM w Assistants API and Action tool 4 autonomous requests

UltraLM-13B reaches top of AlpacaEval leaderboard

This page summarizes the projects mentioned and recommended in the original post on /r/LocalLLaMA Deep Learning large-language-models Evaluation Chatbot foundation-models Post date: 28 Jun 2023

UltraChat

alpaca_eval

InfluxDB

Local-LLM-Comparison-Colab-UI

Related posts

[P] AlpacaEval : An Automatic Evaluator for Instruction-following Language Models

UltraChat's License is now MIT

Looks like there is a new model UltraLM that topped the AlpacaEval Leaderboard

Show HN: Times faster LLM evaluation with Bayesian optimization

Show HN: Mistral LLM w Assistants API and Action tool 4 autonomous requests

This page summarizes the projects mentioned and recommended in the original post on /r/LocalLLaMA
Deep Learning large-language-models Evaluation Chatbot foundation-models
Post date: 28 Jun 2023