llm.f90
mamba-minimal
llm.f90 | mamba-minimal | |
---|---|---|
13 | 2 | |
48 | 2,252 | |
- | - | |
8.4 | 6.6 | |
about 2 months ago | 2 months ago | |
Fortran | Python | |
MIT License | Apache License 2.0 |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
llm.f90
- llm.f90: LLM Inference in Fortran
-
karpathy/llm.c
I'd like to think he took the name from my llm.f90 project https://github.com/rbitr/llm.f90
It was originally based off of Karpathy's llama2.c but I renamed it when I added support for other architectures.
Probable a coincidence :)
-
Winteracter – The Fortran GUI Toolset
I'm a Fortran hobbyist. I'm working (unfortunately less frequently now) on a LLM framework in Fortan: https://github.com/rbitr/llm.f90
- Fortran implementation of phi-2 LLM
- Fortran implementation of phi-2 language model
-
TinyLlama: An Open-Source Small Language Model
Also, I should promote the code I wrote for running this. It runs models in ggml format, the one I made available is an older checkpoint though. It's easy to convert the newer one. And it's in Fortran but it should be easy to get gfortran if you don't have it installed.
https://github.com/rbitr/llm.f90/tree/optimize16/purefortran
- Mamba LLM Inference on CPU
-
Minimal implementation of Mamba, the new LLM architecture, in 1 file of PyTorch
The original mamba code has a lot of speed optimizations and other stuff that make it difficult to immediately get so this will help with learning.
I can't help but also plug my own Mamba inference implementation. https://github.com/rbitr/llm.f90/tree/master/ssm
- Mamba state-space LLM inference
-
Guide to the Mamba architecture that claims to be a replacement for Transformers
You may also be interested in https://github.com/rbitr/llm.f90/tree/master/ssm it's my inference only implementation of mamba which ends up being much simpler than the training code in the original repo
mamba-minimal
-
Minimal implementation of Mamba, the new LLM architecture, in 1 file of PyTorch
If a variable contains batch size, then name it accordingly — batch_size.
And no glossary needed, KISS
https://github.com/johnma2006/mamba-minimal/blob/82efa90919c...
What are some alternatives?
rwkv.f90 - Port of the RWKV-LM model in Fortran (Back to the Future!)
ai-notes - notes for software engineers getting up to speed on new AI developments. Serves as datastore for https://latent.space writing, and product brainstorming, but has cleaned up canonical references under the /Resources folder.
neural-fortran - A parallel framework for deep learning
curated-transformers - 🤖 A PyTorch library of curated Transformer models and their composable components
inference-engine - A deep learning library for use in high-performance computing applications in modern Fortran
Fortran-code-on-GitHub - Directory of Fortran codes on GitHub, arranged by topic
fastGPT - Fast GPT-2 inference written in Fortran
llvm-project - The LLVM Project is a collection of modular and reusable compiler and toolchain technologies.
ClimaCore.jl - CliMA model dycore
mamba - The Fast Cross-Platform Package Manager
fortran-wringer-tests - A collection of non-portable Fortran usage, standard-conformant or otherwise