-
alpaca_eval
An automatic evaluator for instruction-following language models. Human-validated, high-quality, cheap, and fast.
-
InfluxDB
Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
-
Local-LLM-Comparison-Colab-UI
Compare the performance of different LLM that can be deployed locally on consumer hardware. Run yourself with Colab WebUI.
Alpaca Eval is open source and was developed by the same team who trained the alpaca model afaik. It is not like what you said in the other comment
If you want to try it out, you can use Google Colab here with Oobabooga Text Generation UI: Link (Remember to check the instruction template and generation parameters)
Related posts
-
[P] AlpacaEval : An Automatic Evaluator for Instruction-following Language Models
-
UltraChat's License is now MIT
-
Looks like there is a new model UltraLM that topped the AlpacaEval Leaderboard
-
Show HN: Times faster LLM evaluation with Bayesian optimization
-
Show HN: Mistral LLM w Assistants API and Action tool 4 autonomous requests