[R] RWKV-3: Scaling RNN to 1.5B and Reach Transformer LM Performance (without using attention)

This page summarizes the projects mentioned and recommended in the original post on /r/MachineLearning

InfluxDB - Power Real-Time Data Analytics at Scale
Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
www.influxdata.com
featured
SaaSHub - Software Alternatives and Reviews
SaaSHub helps you find the best software and product alternatives
www.saashub.com
featured
  • RWKV-LM

    RWKV is an RNN with transformer-level LLM performance. It can be directly trained like a GPT (parallelizable). So it's combining the best of RNN and transformer - great performance, fast inference, saves VRAM, fast training, "infinite" ctx_len, and free sentence embedding.

  • And RWKV-3 is better. You are welcome to join the project (https://github.com/BlinkDL/RWKV-LM) to build upon it (I am an independent researcher).

  • RWKV-v2-RNN-Pile

    RWKV-v2-RNN trained on the Pile. See https://github.com/BlinkDL/RWKV-LM for details.

  • See https://github.com/BlinkDL/RWKV-v2-RNN-Pile for the ppl vs ctxlen curve :)

  • InfluxDB

    Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

    InfluxDB logo
NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a more popular project.

Suggest a related project

Related posts

  • Ask HN: Freelancer? Seeking freelancer? (May 2024)

    1 project | news.ycombinator.com | 2 May 2024
  • More Low-Bit LLMs

    2 projects | news.ycombinator.com | 2 May 2024
  • Kolmogorov-Arnold Network for Reinforcement Leaning, Initial Experiments

    1 project | news.ycombinator.com | 2 May 2024
  • Create an AI prototyping environment using Jupyter Lab IDE with Typescript, LangChain.js and Ollama for rapid AI prototyping

    4 projects | dev.to | 2 May 2024
  • Show HN: FileKitty – Combine and label text files for LLM prompt contexts

    4 projects | news.ycombinator.com | 1 May 2024