Minimal implementation of Mamba, the new LLM architecture, in 1 file of PyTorch

Our great sponsors

InfluxDB - Power Real-Time Data Analytics at Scale

WorkOS - The modern identity platform for B2B SaaS

SaaSHub - Software Alternatives and Reviews

Our great sponsors

mamba-minimal

2 2,223 6.6 Python

Simple, minimal implementation of the Mamba SSM in one file of PyTorch.

If a variable contains batch size, then name it accordingly — batch_size.
And no glossary needed, KISS
https://github.com/johnma2006/mamba-minimal/blob/82efa90919c...

llm.f90

13 48 8.4 Fortran

LLM inference in Fortran

The original mamba code has a lot of speed optimizations and other stuff that make it difficult to immediately get so this will help with learning.
I can't help but also plug my own Mamba inference implementation. https://github.com/rbitr/llm.f90/tree/master/ssm

InfluxDB

www.influxdata.com sponsored

Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
heinsen_sequence

1 70 8.1

Code implementing "Efficient Parallelization of a Ubiquitious Sequential Computation" (Heinsen, 2023)

with only two calls to the PyTorch API. See the examples here:
  https://github.com/glassroom/heinsen_sequence/blob/main/README.md

mamba

34 6,253 9.5 C++

The Fast Cross-Platform Package Manager (by mamba-org)

>"everyone" seems to know Mamba. I never heard of Mamba
Only the "everybody who knows what mamba is" are the ones upvoting and commenting. Think of all the people who ignore it. For me, Mamba is the faster version of Conda [1], and that's why I clicked on the article.
https://github.com/mamba-org/mamba

ai-notes

15 4,554 9.8 HTML

notes for software engineers getting up to speed on new AI developments. Serves as datastore for https://latent.space writing, and product brainstorming, but has cleaned up canonical references under the /Resources folder.

the field just moves fast. I have curated a list of non-hypey writers and youtubers who explain these things for a typical SWE audience if you are interested. https://github.com/swyxio/ai-notes/blob/main/Resources/Good%...

curated-transformers

7 835 9.0 Python

🤖 A PyTorch library of curated Transformer models and their composable components

https://github.com/explosion/curated-transformers/blob/main/...
Llama 1/2:
https://github.com/explosion/curated-transformers/blob/main/...
MPT:
https://github.com/explosion/curated-transformers/blob/main/...
With various stuff enabled, including support for TorchScript JIT, PyTorch flash attention, etc.

NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a more popular project.

Suggest a related project

‘Nudify’ Apps That Use AI to ‘Undress’ Women in Photos Are Soaring in Popularity
3 projects | /r/technology | 8 Dec 2023
SDXL Turbo: A Real-Time Text-to-Image Generation Model
4 projects | news.ycombinator.com | 28 Nov 2023
Tools For AI Animation and Filmmaking , Community Rules, ect. (**FAQ**)
20 projects | /r/AI_Film_and_Animation | 5 May 2023
Is anyone interested in this minimalistic interface for ControlNet? The idea is to allow super-simple photo editing without the need to tune hundreds of parameters. Thanks!
2 projects | /r/StableDiffusion | 6 Apr 2023
SD Prompt Generation Script - for instant random, decent quality prompts
3 projects | /r/sdforall | 13 Jan 2023

Minimal implementation of Mamba, the new LLM architecture, in 1 file of PyTorch

This page summarizes the projects mentioned and recommended in the original post on news.ycombinator.com
AI parallel-computing prompt-engineering Pytorch stable-diffusion
Post date: 20 Dec 2023

mamba-minimal

llm.f90

InfluxDB

heinsen_sequence

mamba

ai-notes

curated-transformers

Related posts

Minimal implementation of Mamba, the new LLM architecture, in 1 file of PyTorch

This page summarizes the projects mentioned and recommended in the original post on news.ycombinator.com AI parallel-computing prompt-engineering Pytorch stable-diffusion Post date: 20 Dec 2023

mamba-minimal

llm.f90

InfluxDB

heinsen_sequence

mamba

ai-notes

curated-transformers

Related posts

This page summarizes the projects mentioned and recommended in the original post on news.ycombinator.com
AI parallel-computing prompt-engineering Pytorch stable-diffusion
Post date: 20 Dec 2023