RWKV-CUDA

The CUDA version of the RWKV language model ( https://github.com/BlinkDL/RWKV-LM ) (by BlinkDL)

RWKV-CUDA Alternatives

Similar projects and alternatives to RWKV-CUDA

  1. RWKV-LM

    85 RWKV-CUDA VS RWKV-LM

    RWKV (pronounced RwaKuv) is an RNN with great LLM performance, which can also be directly trained like a GPT transformer (parallelizable). We are at RWKV-7 "Goose". So it's combining the best of RNN and transformer - great performance, linear time, constant space (no kv-cache), fast training, infinite ctx_len, and free sentence embedding.

  2. CodeRabbit

    CodeRabbit: AI Code Reviews for Developers. Revolutionize your code reviews with AI. CodeRabbit offers PR summaries, code walkthroughs, 1-click suggestions, and AST-based analysis. Boost productivity and code quality across all major languages with each PR.

    CodeRabbit logo
  3. ChatRWKV

    28 RWKV-CUDA VS ChatRWKV

    ChatRWKV is like ChatGPT but powered by RWKV (100% RNN) language model, and open source.

  4. RWKV-v2-RNN-Pile

    RWKV-v2-RNN trained on the Pile. See https://github.com/BlinkDL/RWKV-LM for details.

  5. rwkv.cpp

    INT4/INT5/INT8 and FP16 inference on CPU for RWKV language model

  6. web-rwkv

    Implementation of the RWKV language model in pure WebGPU/Rust.

  7. AI-Writer

    2 RWKV-CUDA VS AI-Writer

    AI 写小说,生成玄幻和言情网文等等。中文预训练生成模型。采用我的 RWKV 模型,类似 GPT-2 。AI写作。RWKV for Chinese novel generation.

  8. SmallInitEmb

    LayerNorm(SmallInit(Embedding)) in a Transformer to improve convergence

  9. SaaSHub

    SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives

    SaaSHub logo
  10. RWKV-LM-LoRA

    RWKV is a RNN with transformer-level LLM performance. It can be directly trained like a GPT (parallelizable). So it's combining the best of RNN and transformer - great performance, fast inference, saves VRAM, fast training, "infinite" ctx_len, and free sentence embedding.

  11. token-shift-gpt

    Implementation of Token Shift GPT - An autoregressive model that solely relies on shifting the sequence space for mixing

  12. RWKV-infctx-trainer

    1 RWKV-CUDA VS RWKV-infctx-trainer

    RWKV infctx trainer, for training arbitary context sizes, to 10k and beyond!

  13. ai00_server

    The all-in-one RWKV runtime box with embed, RAG, AI agents, and more.

NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a better RWKV-CUDA alternative or higher similarity.

RWKV-CUDA discussion

Log in or Post with

RWKV-CUDA reviews and mentions

Posts with mentions or reviews of RWKV-CUDA. We have used some of these posts to build our list of alternatives and similar projects. The last one was on 2023-12-09.

Stats

Basic RWKV-CUDA repo stats
3
220
2.9
3 months ago

Sponsored
CodeRabbit: AI Code Reviews for Developers
Revolutionize your code reviews with AI. CodeRabbit offers PR summaries, code walkthroughs, 1-click suggestions, and AST-based analysis. Boost productivity and code quality across all major languages with each PR.
coderabbit.ai

Did you know that Cuda is
the 49th most popular programming language
based on number of references?