Eval mmlu result against various infer methods (HF_Causal, VLLM, AutoGPTQ, AutoGPTQ-exllama)

InfluxDB - Power Real-Time Data Analytics at Scale

Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

www.influxdata.com

featured

SaaSHub - Software Alternatives and Reviews

SaaSHub helps you find the best software and product alternatives

www.saashub.com

featured

instruct-eval

6 471 8.0 Python

This repository contains code to quantitatively evaluate instruction-tuned models such as Alpaca and Flan-T5 on held-out tasks.

I modified declare-lab's instruct-eval scripts, add support to VLLM, AutoGPTQ (and new autoGPTQ support exllama now), and test the mmlu result. I also add support to fastllm (which can accelerate ChatGLM2-6b.The code is here https://github.com/declare-lab/instruct-eval , I'd like to hear any errors in those code.

InfluxDB

www.influxdata.com featured

Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a more popular project.

Suggest a related project

[D] Red Pajamas Instruct 7B. Is it really that bad or some some ggml/quantization artifact? Vicuna-7b has no issue writing stories and even does basic text transformation. Yet RP refuses to do anything most of the time. It does generate a story if you run it as a raw model, but gets into a loop.

1 project | /r/MachineLearning | 27 May 2023
[P] The first RedPajama models are here! The 3B and 7B models are now available under Apache 2.0, including instruction-tuned and chat versions. These models aim replicate LLaMA as closely as possible.

1 project | /r/MachineLearning | 6 May 2023
Best Instruct-Trained Alternative to Alpaca/Vicuna?

2 projects | /r/LanguageTechnology | 23 Apr 2023
[R]Comprehensive List of Instruction Datasets for Training LLM Models (GPT-4 & Beyond)

2 projects | /r/MachineLearning | 21 Apr 2023
Ask HN: Which LLMs can run locally on most consumer computers

2 projects | news.ycombinator.com | 21 May 2024

Eval mmlu result against various infer methods (HF_Causal, VLLM, AutoGPTQ, AutoGPTQ-exllama)

This page summarizes the projects mentioned and recommended in the original post on /r/LocalLLaMA
instruct-tuning llm
Post date: 8 Sep 2023

instruct-eval

InfluxDB

Related posts

[D] Red Pajamas Instruct 7B. Is it really that bad or some some ggml/quantization artifact? Vicuna-7b has no issue writing stories and even does basic text transformation. Yet RP refuses to do anything most of the time. It does generate a story if you run it as a raw model, but gets into a loop.

[P] The first RedPajama models are here! The 3B and 7B models are now available under Apache 2.0, including instruction-tuned and chat versions. These models aim replicate LLaMA as closely as possible.

Best Instruct-Trained Alternative to Alpaca/Vicuna?

[R]Comprehensive List of Instruction Datasets for Training LLM Models (GPT-4 & Beyond)

Ask HN: Which LLMs can run locally on most consumer computers

Eval mmlu result against various infer methods (HF_Causal, VLLM, AutoGPTQ, AutoGPTQ-exllama)

This page summarizes the projects mentioned and recommended in the original post on /r/LocalLLaMA instruct-tuning llm Post date: 8 Sep 2023

instruct-eval

InfluxDB

Related posts

[D] Red Pajamas Instruct 7B. Is it really that bad or some some ggml/quantization artifact? Vicuna-7b has no issue writing stories and even does basic text transformation. Yet RP refuses to do anything most of the time. It does generate a story if you run it as a raw model, but gets into a loop.

[P] The first RedPajama models are here! The 3B and 7B models are now available under Apache 2.0, including instruction-tuned and chat versions. These models aim replicate LLaMA as closely as possible.

Best Instruct-Trained Alternative to Alpaca/Vicuna?

[R]Comprehensive List of Instruction Datasets for Training LLM Models (GPT-4 &amp; Beyond)

Ask HN: Which LLMs can run locally on most consumer computers

This page summarizes the projects mentioned and recommended in the original post on /r/LocalLLaMA
instruct-tuning llm
Post date: 8 Sep 2023

[R]Comprehensive List of Instruction Datasets for Training LLM Models (GPT-4 & Beyond)