I created a simple implementation of the RWKV language model (RWKV competes with the dominant Transformers-based approach which is the "T" in GPT)

Our great sponsors

InfluxDB - Power Real-Time Data Analytics at Scale

WorkOS - The modern identity platform for B2B SaaS

SaaSHub - Software Alternatives and Reviews

Our great sponsors

smolrsrwkv

6 90 5.6 Rust

A relatively basic implementation of RWKV in Rust written by someone with very little math and ML knowledge. Supports 32, 8 and 4 bit evaluation. It can also directly load PyTorch RWKV models.

It can now quantize to 8bit for 4x memory savings: https://github.com/KerfuffleV2/smolrsrwkv/tree/experiment-quantize

ChatRWKV

28 9,266 8.3 Python

ChatRWKV is like ChatGPT but powered by RWKV (100% RNN) language model, and open source.

This is a simple Rust implementation of RWKV. Most LLMs use Transformers (i.e. ChatGPT, etc). The creator of the RWKV approach claims it has benefits: https://github.com/BlinkDL/ChatRWKV

InfluxDB

www.influxdata.com sponsored

Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a more popular project.

Suggest a related project

How the RWKV language model works
1 project | news.ycombinator.com | 4 Jul 2023
KoboldCpp - Combining all the various ggml.cpp CPU LLM inference projects with a WebUI and API (formerly llamacpp-for-kobold)
17 projects | /r/LocalLLaMA | 5 Apr 2023
[P] Raven 7B & 14B 🐦(RWKV finetuned on Alpaca+CodeAlpaca+Guanaco) and Gradio Demo for Raven 7B
1 project | /r/MachineLearning | 28 Mar 2023
[D] Totally Open Alternatives to ChatGPT
7 projects | /r/MachineLearning | 18 Mar 2023
[R] RWKV 14B ctx8192 is a zero-shot instruction-follower without finetuning, 23 token/s on 3090 after latest optimization (16G VRAM is enough, and you can stream layers to save more VRAM)
3 projects | /r/MachineLearning | 16 Mar 2023

I created a simple implementation of the RWKV language model (RWKV competes with the dominant Transformers-based approach which is the "T" in GPT)

This page summarizes the projects mentioned and recommended in the original post on /r/rust
language-model Rnn rwkv Chatbot chatgpt
Post date: 30 Mar 2023

smolrsrwkv

ChatRWKV

InfluxDB

Related posts

I created a simple implementation of the RWKV language model (RWKV competes with the dominant Transformers-based approach which is the "T" in GPT)

This page summarizes the projects mentioned and recommended in the original post on /r/rust language-model Rnn rwkv Chatbot chatgpt Post date: 30 Mar 2023

smolrsrwkv

ChatRWKV

InfluxDB

Related posts

This page summarizes the projects mentioned and recommended in the original post on /r/rust
language-model Rnn rwkv Chatbot chatgpt
Post date: 30 Mar 2023