open_llama vs modal-examples

open_llama

OpenLLaMA, a permissively licensed open source reproduction of Meta AI’s LLaMA 7B trained on the RedPajama dataset (by openlm-research)

Suggest topics

Source Code

Suggest alternative

Edit details

modal-examples

Examples of programs built using Modal (by modal-labs)

Cloud Machine Learning Modal Python Serverless Distributed GPU Pytorch stable-diffusion Web

Source Code

modal.com

Suggest alternative

Edit details

Our great sponsors

WorkOS - The modern identity platform for B2B SaaS

InfluxDB - Power Real-Time Data Analytics at Scale

SaaSHub - Software Alternatives and Reviews

Our great sponsors

open_llama		modal-examples
	Project
52	Mentions	9
7,193	Stars	555
1.3%	Growth	14.8%
5.3	Activity	9.5
10 months ago	Latest Commit	9 days ago
	Language	Python
Apache License 2.0	License	MIT License

The number of mentions indicates the total number of mentions that we've tracked plus the number of user suggested alternatives.
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.

open_llama

Posts with mentions or reviews of open_llama. We have used some of these posts to build our list of alternatives and similar projects. The last one was on 2023-12-19.

How Open is Generative AI? Part 2
8 projects | dev.to | 19 Dec 2023

The RedPajama dataset was adapted by the OpenLLaMA project at UC Berkeley, creating an open-source LLaMA equivalent without Meta’s restrictions. The model's later version also included data from Falcon and StarCoder. This highlights the importance of open-source models and datasets, enabling free repurposing and innovation.
GPT-4 API general availability
15 projects | news.ycombinator.com | 6 Jul 2023

OpenLLaMA is though. https://github.com/openlm-research/open_llama
All of these are surmountable problems.
We can beat OpenAI.
We can drain their moat.
Recommend me a computer for local a.i for 500 $
2 projects | /r/ArtificialInteligence | 1 Jul 2023

#1: 🌞 Open-source Reproduction of Meta AI’s LLaMA OpenLLaMA-13B released. (trained for 1T tokens) | 0 comments #2: 🎉 #1 on HuggingFace.co's Leaderboard Model Falcon 40B is now Free (Apache 2.0 License) | 0 comments #3: 😍 Have you seen this repo? "running LLMs on consumer-grade hardware. compatible models: llama.cpp, alpaca.cpp, gpt4all.cpp, rwkv.cpp, whisper.cpp, vicuna, koala, gpt4all-j, cerebras and many others!" | 0 comments
Who is openllama from?
1 project | /r/LocalLLaMA | 30 Jun 2023

Trained OpenLLaMA models are from the OpenLM Research team in collaboration with Stability AI: https://github.com/openlm-research/open_llama
Personal GPT: A tiny AI Chatbot that runs fully offline on your iPhone
14 projects | /r/ChatGPT | 30 Jun 2023

I can't use Llama or any model from the Llama family, due to license restrictions. Although now there's also the OpenLlama family of models, which have the same architecture but were trained on an open dataset (RedPajama, the same dataset the base model in my app was trained on). I'd love to pursue the direction of extended context lengths for on-device LLMs. Likely in a month or so, when I've implemented all the product feature that I currently have on my backlog.
XGen-7B, a new 7B foundational model trained on up to 8K length for 1.5T tokens
3 projects | news.ycombinator.com | 28 Jun 2023

https://github.com/openlm-research/open_llama#update-0615202...).
XGen-7B is probably the superior 7B model, it's trained on more tokens and a longer default sequence length (although both presumably can adopt SuperHOT (Position Interpolation) to extend context), but larger models still probably perform better on an absolute basis.
MosaicML Agrees to Join Databricks to Power Generative AI for All
3 projects | /r/LocalLLaMA | 26 Jun 2023

Compare it to openllama. It github doesn't have a single script on how to do anything.
Databricks Strikes $1.3B Deal for Generative AI Startup MosaicML
4 projects | news.ycombinator.com | 26 Jun 2023

OpenLLaMA models up to 13B parameters have now been trained on 1T tokens:
https://github.com/openlm-research/open_llama
Containerized AI before Apocalypse 🐳🤖
4 projects | dev.to | 25 Jun 2023

The deployed LLM binary, orca mini, has 3 billion parameters. Orca mini is based on the OpenLLaMA project.
AI — weekly megathread!
2 projects | /r/artificial | 23 Jun 2023

OpenLM Research released its 1T token version of OpenLLaMA 13B - the permissively licensed open source reproduction of Meta AI's LLaMA large language model. [Details].

modal-examples

Posts with mentions or reviews of modal-examples. We have used some of these posts to build our list of alternatives and similar projects. The last one was on 2023-05-16.

Show HN: Real-time image autocomplete in <100 lines of code with SDXL Lightning
1 project | news.ycombinator.com | 23 Feb 2024

We made a small app for SDXL Lightning, running your own Python code on GPUs. It generates images in real time.
https://potatoes.ai/
We know there was a fal.ai post yesterday, and that got a lot of interest, but we also made this demo yesterday and didn't share — just wanted to mention it as an alternative option for people who like running their own code and custom models instead of using a prebuilt API provider.
The backend code is open-source too and you can deploy it yourself: https://github.com/modal-labs/modal-examples/blob/main/06_gpu_and_ml/stable_diffusion/stable_diffusion_xl_lightning.py
Our startup has docs issues and it is costing us prospects. What things can you share to help us?
3 projects | /r/ExperiencedDevs | 16 May 2023

The startup I work at is relatively pretty good at documentation engineering. We have written code to test the code snippets in docstrings (https://github.com/modal-labs/pytest-markdown-docs) and we have written code to do synthetic monitoring testing of the examples in our examples repo (https://github.com/modal-labs/modal-examples). We are also diligent about putting using Python's warnings library to handle API deprecation, and treat deprecation warnings as errors internally, ensuring our own code samples and examples are most up-to-date.
OpenLLaMA: An Open Reproduction of LLaMA
14 projects | news.ycombinator.com | 2 May 2023

You can get it running with one Python script on Modal.com :)
https://github.com/modal-labs/modal-examples/blob/main/06_gp...
Whispers AI Modular Future
14 projects | news.ycombinator.com | 20 Feb 2023

This demo lets you choose the podcast, and is open-source: https://modal-labs--whisper-pod-transcriber-fastapi-app.moda...
https://github.com/modal-labs/modal-examples/tree/main/06_gp...
Transcribes 1hr of audio in roughly 1min, using parallelisation across CPUs.
Show HN: PodText.ai – Search anything said on a podcast, Highlight text to play
4 projects | news.ycombinator.com | 9 Feb 2023

This demo is open-source: https://github.com/modal-labs/modal-examples/tree/main/06_gp....
https://modal-labs--whisper-pod-transcriber-fastapi-app.moda...
Show HN: Stable Diffusion Pokémon Cards
1 project | news.ycombinator.com | 14 Jan 2023

It's become so easy to stick together ML models, often without training most or all of them yourself.
*video demo:* https://youtu.be/mQsMuM8d4Qc
*cloud platform:* https://modal.com
*code*: https://github.com/modal-labs/modal-examples/tree/main/06_gp...
How can machine learning help us learn languages better?
1 project | /r/languagelearning | 7 Nov 2022

Transcription - OpenAI just released Whisper. Check out what it can do with podcasts
[P] Transcribe any podcast episode in just 1 minute with optimized OpenAI/whisper
4 projects | /r/MachineLearning | 6 Nov 2022

Here's the source code.

What are some alternatives?

When comparing open_llama and modal-examples you can also consider the following projects:

FastChat - An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.

text-generation-webui - A Gradio web UI for Large Language Models. Supports transformers, GPTQ, AWQ, EXL2, llama.cpp (GGUF), Llama models.

llama.cpp - LLM inference in C/C++

FlexGen - Running large language models on a single GPU for throughput-oriented scenarios.

RWKV-LM - RWKV is an RNN with transformer-level LLM performance. It can be directly trained like a GPT (parallelizable). So it's combining the best of RNN and transformer - great performance, fast inference, saves VRAM, fast training, "infinite" ctx_len, and free sentence embedding.

WAAS - Whisper as a Service (GUI and API with queuing for OpenAI Whisper)

gpt4all - gpt4all: run open-source LLMs anywhere

EasyLM - Large language models (LLMs) made easy, EasyLM is a one stop solution for pre-training, finetuning, evaluating and serving LLMs in JAX/Flax.

gorilla - Gorilla: An API store for LLMs

mlc-llm - Enable everyone to develop, optimize and deploy AI models natively on everyone's devices.

ggml - Tensor library for machine learning

brev-cli - Connect your laptop to cloud computers. Follow to stay updated about our product