[R] RWKV 14B ctx8192 is a zero-shot instruction-follower without finetuning, 23 token/s on 3090 after latest optimization (16G VRAM is enough, and you can stream layers to save more VRAM)

InfluxDB - Power Real-Time Data Analytics at Scale

Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

www.influxdata.com

featured

SaaSHub - Software Alternatives and Reviews

SaaSHub helps you find the best software and product alternatives

www.saashub.com

featured

ChatRWKV

28 9,282 8.3 Python

ChatRWKV is like ChatGPT but powered by RWKV (100% RNN) language model, and open source.

ChatRWKV v2 is now using a CUDA kernel to optimize INT8 inference (23 token/s on 3090): https://github.com/BlinkDL/ChatRWKV

SpikeGPT

7 691 7.2 Python

Implementation of "SpikeGPT: Generative Pre-trained Language Model with Spiking Neural Networks"

Soon :) working on it. Meanwhile take a look at https://github.com/ridgerchu/SpikeGPT which is a SNN version of RWKV, so has some explaination in the paper.

InfluxDB

www.influxdata.com featured

Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
RWKV-LM-LoRA

4 401 5.6 Python

RWKV is a RNN with transformer-level LLM performance. It can be directly trained like a GPT (parallelizable). So it's combining the best of RNN and transformer - great performance, fast inference, saves VRAM, fast training, "infinite" ctx_len, and free sentence embedding.

Someone in RWKV Discord tried it using LoRA (https://github.com/Blealtan/RWKV-LM-LoRA) and the result is quite nice. Join RWKV Discord for latest updates :)

NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a more popular project.

Suggest a related project

How the RWKV language model works

1 project | news.ycombinator.com | 4 Jul 2023
[P] Raven 7B & 14B 🐦(RWKV finetuned on Alpaca+CodeAlpaca+Guanaco) and Gradio Demo for Raven 7B

1 project | /r/MachineLearning | 28 Mar 2023
[D] Totally Open Alternatives to ChatGPT

7 projects | /r/MachineLearning | 18 Mar 2023
[P] RWKV 14B is a strong chatbot despite only trained on Pile (16G VRAM for 14B ctx4096 INT8, more optimizations incoming)

6 projects | /r/MachineLearning | 10 Mar 2023
[R] RWKV (100% RNN) can genuinely model ctx4k+ documents in Pile, and RWKV model+inference+generation in 150 lines of Python

5 projects | /r/MachineLearning | 5 Mar 2023

[R] RWKV 14B ctx8192 is a zero-shot instruction-follower without finetuning, 23 token/s on 3090 after latest optimization (16G VRAM is enough, and you can stream layers to save more VRAM)

This page summarizes the projects mentioned and recommended in the original post on /r/MachineLearning
Chatbot chatgpt language-model Pytorch Rnn
Post date: 16 Mar 2023

ChatRWKV

SpikeGPT

InfluxDB

RWKV-LM-LoRA

Related posts

How the RWKV language model works

[P] Raven 7B & 14B 🐦(RWKV finetuned on Alpaca+CodeAlpaca+Guanaco) and Gradio Demo for Raven 7B

[D] Totally Open Alternatives to ChatGPT

[P] RWKV 14B is a strong chatbot despite only trained on Pile (16G VRAM for 14B ctx4096 INT8, more optimizations incoming)

[R] RWKV (100% RNN) can genuinely model ctx4k+ documents in Pile, and RWKV model+inference+generation in 150 lines of Python

[R] RWKV 14B ctx8192 is a zero-shot instruction-follower without finetuning, 23 token/s on 3090 after latest optimization (16G VRAM is enough, and you can stream layers to save more VRAM)

This page summarizes the projects mentioned and recommended in the original post on /r/MachineLearning Chatbot chatgpt language-model Pytorch Rnn Post date: 16 Mar 2023

ChatRWKV

SpikeGPT

InfluxDB

RWKV-LM-LoRA

Related posts

How the RWKV language model works

[P] Raven 7B &amp; 14B 🐦(RWKV finetuned on Alpaca+CodeAlpaca+Guanaco) and Gradio Demo for Raven 7B

[D] Totally Open Alternatives to ChatGPT

[P] RWKV 14B is a strong chatbot despite only trained on Pile (16G VRAM for 14B ctx4096 INT8, more optimizations incoming)

[R] RWKV (100% RNN) can genuinely model ctx4k+ documents in Pile, and RWKV model+inference+generation in 150 lines of Python

This page summarizes the projects mentioned and recommended in the original post on /r/MachineLearning
Chatbot chatgpt language-model Pytorch Rnn
Post date: 16 Mar 2023

[P] Raven 7B & 14B 🐦(RWKV finetuned on Alpaca+CodeAlpaca+Guanaco) and Gradio Demo for Raven 7B