Open-Lamam: A real open-source project to train LLM

InfluxDB - Power Real-Time Data Analytics at Scale

Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

www.influxdata.com

featured

SaaSHub - Software Alternatives and Reviews

SaaSHub helps you find the best software and product alternatives

www.saashub.com

featured

Open-Llama

7 637 10.0 Python

Discontinued The complete training code of the open-source high-performance Llama model, including the full process from pre-training to RLHF.
open_llama

52 7,211 5.3

OpenLLaMA, a permissively licensed open source reproduction of Meta AI’s LLaMA 7B trained on the RedPajama dataset

Namespace collisions are inevitable, especially w/ how fast-moving the LLM space is right now, just wanted to point out that besides this "Open-Llama" project (which looks really interesting, and well documented in the Github repo), there is also another group training "OpenLLaMA" https://github.com/openlm-research/open_llama (which looks like an effort by two Berkeley PhD students, https://www.haoliu.site/ and http://young-geng.xyz/ to reproduce LLaMA using the 1.2T token Together RedPajama dataset. They've released up to a 300B checkpoint so far.)
Feedback for /u/bayes-song - it'd be great to have a more info on the model card on HF - right now it's unclear the parameter count, # of total tokens you're planning on training on/how many you've trained on so far. An Evaluation section (maybe using lm-evaluation-harness) might be good as well?

InfluxDB

www.influxdata.com featured

Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a more popular project.

Suggest a related project

GPT-4o

7 projects | news.ycombinator.com | 13 May 2024
GPT-4o: Learn how to Implement a RAG on the new model, step-by-step!

1 project | dev.to | 13 May 2024
Tired of Makefiles

3 projects | news.ycombinator.com | 13 May 2024
Python library that provides easy to integrate string token based pagination

1 project | news.ycombinator.com | 13 May 2024
Python FastAPI: Integrating OAuth2 Security with the Application's Own Authentication Process

4 projects | dev.to | 13 May 2024

Open-Lamam: A real open-source project to train LLM

This page summarizes the projects mentioned and recommended in the original post on news.ycombinator.com Post date: 13 May 2023

Open-Llama

open_llama

InfluxDB

Related posts

GPT-4o

GPT-4o: Learn how to Implement a RAG on the new model, step-by-step!

Tired of Makefiles

Python library that provides easy to integrate string token based pagination

Python FastAPI: Integrating OAuth2 Security with the Application's Own Authentication Process