open_llama
Open-Llama
Our great sponsors
open_llama | Open-Llama | |
---|---|---|
52 | 7 | |
7,193 | 637 | |
1.3% | - | |
5.3 | 10.0 | |
10 months ago | 12 months ago | |
Python | ||
Apache License 2.0 | MIT License |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
open_llama
-
How Open is Generative AI? Part 2
The RedPajama dataset was adapted by the OpenLLaMA project at UC Berkeley, creating an open-source LLaMA equivalent without Metaโs restrictions. The model's later version also included data from Falcon and StarCoder. This highlights the importance of open-source models and datasets, enabling free repurposing and innovation.
-
GPT-4 API general availability
OpenLLaMA is though. https://github.com/openlm-research/open_llama
All of these are surmountable problems.
We can beat OpenAI.
We can drain their moat.
-
Recommend me a computer for local a.i for 500 $
#1: ๐ Open-source Reproduction of Meta AIโs LLaMA OpenLLaMA-13B released. (trained for 1T tokens) | 0 comments #2: ๐ #1 on HuggingFace.co's Leaderboard Model Falcon 40B is now Free (Apache 2.0 License) | 0 comments #3: ๐ Have you seen this repo? "running LLMs on consumer-grade hardware. compatible models: llama.cpp, alpaca.cpp, gpt4all.cpp, rwkv.cpp, whisper.cpp, vicuna, koala, gpt4all-j, cerebras and many others!" | 0 comments
-
Who is openllama from?
Trained OpenLLaMA models are from the OpenLM Research team in collaboration with Stability AI: https://github.com/openlm-research/open_llama
-
Personal GPT: A tiny AI Chatbot that runs fully offline on your iPhone
I can't use Llama or any model from the Llama family, due to license restrictions. Although now there's also the OpenLlama family of models, which have the same architecture but were trained on an open dataset (RedPajama, the same dataset the base model in my app was trained on). I'd love to pursue the direction of extended context lengths for on-device LLMs. Likely in a month or so, when I've implemented all the product feature that I currently have on my backlog.
-
XGen-7B, a new 7B foundational model trained on up to 8K length for 1.5T tokens
https://github.com/openlm-research/open_llama#update-0615202...).
XGen-7B is probably the superior 7B model, it's trained on more tokens and a longer default sequence length (although both presumably can adopt SuperHOT (Position Interpolation) to extend context), but larger models still probably perform better on an absolute basis.
-
MosaicML Agrees to Join Databricks to Power Generative AI for All
Compare it to openllama. It github doesn't have a single script on how to do anything.
-
Databricks Strikes $1.3B Deal for Generative AI Startup MosaicML
OpenLLaMA models up to 13B parameters have now been trained on 1T tokens:
https://github.com/openlm-research/open_llama
-
Containerized AI before Apocalypse ๐ณ๐ค
The deployed LLM binary, orca mini, has 3 billion parameters. Orca mini is based on the OpenLLaMA project.
-
AI โ weekly megathread!
OpenLM Research released its 1T token version of OpenLLaMA 13B - the permissively licensed open source reproduction of Meta AI's LLaMA large language model. [Details].
Open-Llama
-
(1/2) May 2023
Training code of the open-source high-performance Llama model, including the full process from pre-training to RLHF (https://github.com/s-JoL/Open-Llama)
- Open-Lamam: A โrealโ open-source project to train LLM not just checkpoints
- Open-Lamam: A real open-source project to train LLM
- Open-Llama: A Open Source Project for Training Language Models
-
OpenLLaMA: An Open Reproduction of LLaMA
Really exciting how fast fully pre-trained new models are appearing.
Here's another repo (with the same "open-llama" name) that has been available on hugging face as well for a few weeks. (different training dataset)
https://github.com/s-JoL/Open-Llama
-
Build your onw LLM 101
Open-Llama
- Open-Llama is an open source project that provides a complete set of training processes for building large-scale language models, from data preparation to tokenization, pre-training, instruction tuning, and reinforcement learning techniques such as RLHF.
What are some alternatives?
FastChat - An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.
My-Medium-Articles-Friendly-Links - Friendly link to all of my medium articles
llama.cpp - LLM inference in C/C++
AgileRL - Streamlining reinforcement learning with RLOps. State-of-the-art RL algorithms and tools.
RWKV-LM - RWKV is an RNN with transformer-level LLM performance. It can be directly trained like a GPT (parallelizable). So it's combining the best of RNN and transformer - great performance, fast inference, saves VRAM, fast training, "infinite" ctx_len, and free sentence embedding.
mlc-llm - Enable everyone to develop, optimize and deploy AI models natively on everyone's devices.
gpt4all - gpt4all: run open-source LLMs anywhere
promptfoo - Test your prompts, models, and RAGs. Catch regressions and improve prompt quality. LLM evals for OpenAI, Azure, Anthropic, Gemini, Mistral, Llama, Bedrock, Ollama, and other local & private models with CI/CD integration.
gorilla - Gorilla: An API store for LLMs
EasyLM - Large language models (LLMs) made easy, EasyLM is a one stop solution for pre-training, finetuning, evaluating and serving LLMs in JAX/Flax.
ggml - Tensor library for machine learning
text-generation-webui - A Gradio web UI for Large Language Models. Supports transformers, GPTQ, AWQ, EXL2, llama.cpp (GGUF), Llama models.