Minimal implementation of Mamba, the new LLM architecture, in 1 file of PyTorch

This page summarizes the projects mentioned and recommended in the original post on news.ycombinator.com

Our great sponsors
  • InfluxDB - Power Real-Time Data Analytics at Scale
  • WorkOS - The modern identity platform for B2B SaaS
  • SaaSHub - Software Alternatives and Reviews
  • mamba-minimal

    Simple, minimal implementation of the Mamba SSM in one file of PyTorch.

  • If a variable contains batch size, then name it accordingly — batch_size.

    And no glossary needed, KISS

    https://github.com/johnma2006/mamba-minimal/blob/82efa90919c...

  • llm.f90

    LLM inference in Fortran

  • The original mamba code has a lot of speed optimizations and other stuff that make it difficult to immediately get so this will help with learning.

    I can't help but also plug my own Mamba inference implementation. https://github.com/rbitr/llm.f90/tree/master/ssm

  • InfluxDB

    Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

    InfluxDB logo
  • heinsen_sequence

    Code implementing "Efficient Parallelization of a Ubiquitious Sequential Computation" (Heinsen, 2023)

  • with only two calls to the PyTorch API. See the examples here:

      https://github.com/glassroom/heinsen_sequence/blob/main/README.md

  • mamba

    The Fast Cross-Platform Package Manager (by mamba-org)

  • >"everyone" seems to know Mamba. I never heard of Mamba

    Only the "everybody who knows what mamba is" are the ones upvoting and commenting. Think of all the people who ignore it. For me, Mamba is the faster version of Conda [1], and that's why I clicked on the article.

    https://github.com/mamba-org/mamba

  • ai-notes

    notes for software engineers getting up to speed on new AI developments. Serves as datastore for https://latent.space writing, and product brainstorming, but has cleaned up canonical references under the /Resources folder.

  • the field just moves fast. I have curated a list of non-hypey writers and youtubers who explain these things for a typical SWE audience if you are interested. https://github.com/swyxio/ai-notes/blob/main/Resources/Good%...

  • curated-transformers

    🤖 A PyTorch library of curated Transformer models and their composable components

  • https://github.com/explosion/curated-transformers/blob/main/...

    Llama 1/2:

    https://github.com/explosion/curated-transformers/blob/main/...

    MPT:

    https://github.com/explosion/curated-transformers/blob/main/...

    With various stuff enabled, including support for TorchScript JIT, PyTorch flash attention, etc.

NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a more popular project.

Suggest a related project

Related posts