SpikeGPT: 230M-parameter Spiking Neural Network trained to be a language model

This page summarizes the projects mentioned and recommended in the original post on /r/MachineLearning

InfluxDB - Power Real-Time Data Analytics at Scale
Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
www.influxdata.com
featured
SaaSHub - Software Alternatives and Reviews
SaaSHub helps you find the best software and product alternatives
www.saashub.com
featured
  • SpikeGPT

    Implementation of "SpikeGPT: Generative Pre-trained Language Model with Spiking Neural Networks"

  • Thanks for reminding us! we just uploaded the model weight to huggingface and update the readme.md of our repo. (PS: this model only trained about 900M token on BookCorpus)

  • RWKV-LM

    RWKV is an RNN with transformer-level LLM performance. It can be directly trained like a GPT (parallelizable). So it's combining the best of RNN and transformer - great performance, fast inference, saves VRAM, fast training, "infinite" ctx_len, and free sentence embedding.

  • They don't use transformer-style attention because it doesn't work well for SNNs. The additional time domain makes it scale with N3 instead of N2. Instead, they use Receptance Weighted Key Value - a technique introduced by Apple and better known for the open source RWKV-LM model.

  • InfluxDB

    Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

    InfluxDB logo
NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a more popular project.

Suggest a related project

Related posts

  • BLint: Check the security properties, and capabilities in your executables

    1 project | news.ycombinator.com | 14 May 2024
  • Casino Terminal Game

    2 projects | dev.to | 14 May 2024
  • Project-Gameface

    1 project | news.ycombinator.com | 14 May 2024
  • GPT-4o's Memory Breakthrough (Needle in a Needlestack)

    1 project | news.ycombinator.com | 14 May 2024
  • Glance: A self-hosted dashboard that puts all your feeds in one place

    2 projects | news.ycombinator.com | 14 May 2024