Minimal implementation of Mamba, the new LLM architecture, in 1 file of PyTorch

This page summarizes the projects mentioned and recommended in the original post on news.ycombinator.com

Sevalla - Deploy and host your apps and databases, now with $50 credit!
Sevalla is the PaaS you have been looking for! Advanced deployment pipelines, usage-based pricing, preview apps, templates, human support by developers, and much more!
sevalla.com
featured
InfluxDB – Built for High-Performance Time Series Workloads
InfluxDB 3 OSS is now GA. Transform, enrich, and act on time series data directly in the database. Automate critical tasks and eliminate the need to move data externally. Download now.
www.influxdata.com
featured
  1. mamba-minimal

    Simple, minimal implementation of the Mamba SSM in one file of PyTorch.

    If a variable contains batch size, then name it accordingly — batch_size.

    And no glossary needed, KISS

    https://github.com/johnma2006/mamba-minimal/blob/82efa90919c...

  2. Sevalla

    Deploy and host your apps and databases, now with $50 credit! Sevalla is the PaaS you have been looking for! Advanced deployment pipelines, usage-based pricing, preview apps, templates, human support by developers, and much more!

    Sevalla logo
  3. llm.f90

    LLM inference in Fortran

    The original mamba code has a lot of speed optimizations and other stuff that make it difficult to immediately get so this will help with learning.

    I can't help but also plug my own Mamba inference implementation. https://github.com/rbitr/llm.f90/tree/master/ssm

  4. heinsen_sequence

    Code implementing "Efficient Parallelization of a Ubiquitious Sequential Computation" (Heinsen, 2023)

    with only two calls to the PyTorch API. See the examples here:

      https://github.com/glassroom/heinsen_sequence/blob/main/README.md

  5. mamba

    The Fast Cross-Platform Package Manager (by mamba-org)

    >"everyone" seems to know Mamba. I never heard of Mamba

    Only the "everybody who knows what mamba is" are the ones upvoting and commenting. Think of all the people who ignore it. For me, Mamba is the faster version of Conda [1], and that's why I clicked on the article.

    https://github.com/mamba-org/mamba

  6. ai-notes

    notes for software engineers getting up to speed on new AI developments. Serves as datastore for https://latent.space writing, and product brainstorming, but has cleaned up canonical references under the /Resources folder.

    the field just moves fast. I have curated a list of non-hypey writers and youtubers who explain these things for a typical SWE audience if you are interested. https://github.com/swyxio/ai-notes/blob/main/Resources/Good%...

  7. curated-transformers

    🤖 A PyTorch library of curated Transformer models and their composable components

    https://github.com/explosion/curated-transformers/blob/main/...

    Llama 1/2:

    https://github.com/explosion/curated-transformers/blob/main/...

    MPT:

    https://github.com/explosion/curated-transformers/blob/main/...

    With various stuff enabled, including support for TorchScript JIT, PyTorch flash attention, etc.

NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a more popular project.

Suggest a related project

Related posts

  • The Yoga of Image Generation – Part 1

    6 projects | dev.to | 11 Feb 2025
  • Deploy ComfyUI with RunPod Serverless

    2 projects | dev.to | 22 Oct 2024
  • Learn in Public (2018)

    1 project | news.ycombinator.com | 24 Jul 2024
  • A Picture Is Worth 170 Tokens: How Does GPT-4o Encode Images?

    5 projects | news.ycombinator.com | 7 Jun 2024
  • ‘Nudify’ Apps That Use AI to ‘Undress’ Women in Photos Are Soaring in Popularity

    3 projects | /r/technology | 8 Dec 2023

Did you know that Python is
the 2nd most popular programming language
based on number of references?