OpenLLaMA Releases 7B/3B Checkpoints with 700B/600B Tokens

InfluxDB - Power Real-Time Data Analytics at Scale

Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

www.influxdata.com

featured

SaaSHub - Software Alternatives and Reviews

SaaSHub helps you find the best software and product alternatives

www.saashub.com

featured

EasyLM

8 2,241 7.7 Python

Large language models (LLMs) made easy, EasyLM is a one stop solution for pre-training, finetuning, evaluating and serving LLMs in JAX/Flax.

We train the models on cloud TPU-v4s using EasyLM, a JAX based training pipeline we developed for training and fine-tuning large language models. We employ a combination of normal data parallelism and fully sharded data parallelism (also know as ZeRO stage 3) to balance the training throughput and memory usage. Overall we reach a throughput of over 2100 tokens / second / TPU-v4 chip for our 7B model. The training loss can be seen in the figure below.

InfluxDB

www.influxdata.com featured

Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a more popular project.

Suggest a related project

How To Fine-Tune LLaMA, OpenLLaMA, And XGen, With JAX On A GPU Or A TPU

1 project | /r/LocalLLaMA | 4 Jul 2023
Open-sourced LLMs are adept at mimicking ChatGPT’s style but not its factuality. There exists a substantial capabilities gap, which requires better base LM.

1 project | /r/integratedai | 3 Jun 2023
Paid dev gig: develop a basic LLM PEFT finetuning utility

2 projects | /r/LocalLLaMA | 2 Jun 2023
Koala: A Dialogue Model for Academic Research [Finetuned Llama-13B on a dataset generated by ChatGPT]

2 projects | /r/LocalLLaMA | 4 Apr 2023
Maxtext: A simple, performant and scalable Jax LLM

10 projects | news.ycombinator.com | 23 Apr 2024

OpenLLaMA Releases 7B/3B Checkpoints with 700B/600B Tokens

This page summarizes the projects mentioned and recommended in the original post on /r/LocalLLaMA
Deep Learning flax Jax language-model Natural Language Processing
Post date: 23 May 2023

EasyLM

InfluxDB

Related posts

How To Fine-Tune LLaMA, OpenLLaMA, And XGen, With JAX On A GPU Or A TPU

Open-sourced LLMs are adept at mimicking ChatGPT’s style but not its factuality. There exists a substantial capabilities gap, which requires better base LM.

Paid dev gig: develop a basic LLM PEFT finetuning utility

Koala: A Dialogue Model for Academic Research [Finetuned Llama-13B on a dataset generated by ChatGPT]

Maxtext: A simple, performant and scalable Jax LLM

OpenLLaMA Releases 7B/3B Checkpoints with 700B/600B Tokens

This page summarizes the projects mentioned and recommended in the original post on /r/LocalLLaMA Deep Learning flax Jax language-model Natural Language Processing Post date: 23 May 2023

EasyLM

InfluxDB

Related posts

How To Fine-Tune LLaMA, OpenLLaMA, And XGen, With JAX On A GPU Or A TPU

Open-sourced LLMs are adept at mimicking ChatGPT’s style but not its factuality. There exists a substantial capabilities gap, which requires better base LM.

Paid dev gig: develop a basic LLM PEFT finetuning utility

Koala: A Dialogue Model for Academic Research [Finetuned Llama-13B on a dataset generated by ChatGPT]

Maxtext: A simple, performant and scalable Jax LLM

This page summarizes the projects mentioned and recommended in the original post on /r/LocalLLaMA
Deep Learning flax Jax language-model Natural Language Processing
Post date: 23 May 2023