Mamba-Chat: A Chat LLM based on State Space Models

Our great sponsors

InfluxDB - Power Real-Time Data Analytics at Scale

WorkOS - The modern identity platform for B2B SaaS

SaaSHub - Software Alternatives and Reviews

Our great sponsors

mamba

15 9,307 8.3 Python

You might have come across the paper Mamba paper in the last days, which was the first attempt at scaling up state space models to 2.8B parameters to work on language data.

mamba-chat

4 823 7.6 Python

Mamba-Chat: A chat LLM based on the state-space model architecture 🐍

Feel free to check out our Github or Huggingface repository! Our Github repo includes a cli chat script, so you can easily run the model if you have access to a GPU.

InfluxDB

www.influxdata.com sponsored

Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
text-generation-webui

1 7 9.8

A Gradio web UI for Large Language Models. Supports transformers, GPTQ, AWQ, EXL2, llama.cpp (GGUF), Llama models. (by trap20)

Seems to run in my hacked together text-generation-webui branch for mamba-ssm: https://github.com/trap20/text-generation-webui/tree/mamba-ssm

llama.cpp

769 56,891 10.0 C++

LLM inference in C/C++
onnxruntime

54 12,656 10.0 C++

ONNX Runtime: cross-platform, high performance ML inferencing and training accelerator
causal-conv1d

1 162 7.3 Cuda

Causal depthwise conv1d in CUDA, with a PyTorch interface

NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a more popular project.

Suggest a related project

ONNX runtime: Cross-platform accelerated machine learning
1 project | /r/hackernews | 27 Jul 2023
Onnx Runtime: “Cross-Platform Accelerated Machine Learning”
1 project | /r/hypeurls | 27 Jul 2023
How would I go about implementing machine learning in my projects from a software engineering perspective?
1 project | /r/learnmachinelearning | 25 May 2023
You probably don't know how to do Prompt Engineering
2 projects | news.ycombinator.com | 28 Apr 2023
ONNX Runtime merges WebGPU backend
1 project | /r/patient_hackernews | 25 Apr 2023

Mamba-Chat: A Chat LLM based on State Space Models

This page summarizes the projects mentioned and recommended in the original post on /r/LocalLLaMA
Deep Learning Onnx neural-networks Machine Learning ai-framework
Post date: 7 Dec 2023

mamba

mamba-chat

InfluxDB

text-generation-webui

llama.cpp

onnxruntime

causal-conv1d

Related posts

Mamba-Chat: A Chat LLM based on State Space Models

This page summarizes the projects mentioned and recommended in the original post on /r/LocalLLaMA Deep Learning Onnx neural-networks Machine Learning ai-framework Post date: 7 Dec 2023

mamba

mamba-chat

InfluxDB

text-generation-webui

llama.cpp

onnxruntime

causal-conv1d

Related posts

This page summarizes the projects mentioned and recommended in the original post on /r/LocalLLaMA
Deep Learning Onnx neural-networks Machine Learning ai-framework
Post date: 7 Dec 2023