From Deep to Long Learning

Our great sponsors

InfluxDB - Power Real-Time Data Analytics at Scale

WorkOS - The modern identity platform for B2B SaaS

SaaSHub - Software Alternatives and Reviews

Our great sponsors

heinsen_routing

7 160 2.7 Python

Reference implementation of "An Algorithm for Routing Vectors in Sequences" (Heinsen, 2022) and "An Algorithm for Routing Capsules in All Domains" (Heinsen, 2019), for composing deep neural networks.

This looks really interesting! I'm going to take a closer look.
It reminds me of a dynamic routing algorithm (related to self-attention) that can handle sequences with 1M+ tokens: https://github.com/glassroom/heinsen_routing . Right now, you could take 1,000 sequences of hidden states computed by a pretrained transformer, each sequence with, say, 1024 tokens, concatenate them into a single ultra-long sequence with 1,024,000 hidden states, slap 1,024,000 position encodings on top, and feed the whole thing to that routing algorithm to predict the next token.

block-recurrent-transformer-pytorch

1 204 5.0 Python

Implementation of Block Recurrent Transformer - Pytorch

that line of research is still going. https://github.com/lucidrains/block-recurrent-transformer-py... i think it is worth continuing research on both fronts.

InfluxDB

www.influxdata.com sponsored

Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
block-recurrent-transformer-py

1 - -

that line of research is still going. https://github.com/lucidrains/block-recurrent-transformer-py... i think it is worth continuing research on both fronts.

RWKV-LM

84 11,619 8.8 Python

RWKV is an RNN with transformer-level LLM performance. It can be directly trained like a GPT (parallelizable). So it's combining the best of RNN and transformer - great performance, fast inference, saves VRAM, fast training, "infinite" ctx_len, and free sentence embedding.

https://github.com/BlinkDL/RWKV-LM this claims to work well with long sequences.

iris

8 754 1.9 Python

Transformers are Sample-Efficient World Models. ICLR 2023, notable top 5%. (by eloialonso)

Yea, after all these LLMs are predicting one sequence of tokens from another sequence of tokens and the tokens could be anything, it just "happens" that text has the most knowledge and the easiest to input, then there are image, sound, video, but tokens could also be learned from world experience in RL:
Transformers are Sample-Efficient World Models:
https://github.com/eloialonso/iris#transformers-are-sample-e...

WorkOS

workos.com sponsored

The modern identity platform for B2B SaaS. The APIs are flexible and easy-to-use, supporting authentication, user identity, and complex enterprise features like SSO and SCIM provisioning.

NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a more popular project.

Suggest a related project

What can LLMs never do?
4 projects | news.ycombinator.com | 27 Apr 2024
Voxel51 Is Hiring AI Researchers and Scientists — What the New Open Science Positions Mean
1 project | dev.to | 26 Apr 2024
Machine Learning and AI Beyond the Basics Book
1 project | news.ycombinator.com | 16 Apr 2024
How to Cluster Images
5 projects | dev.to | 9 Apr 2024
FREE AI Course By Microsoft: ZERO to HERO! 🔥
1 project | dev.to | 18 Mar 2024

From Deep to Long Learning

This page summarizes the projects mentioned and recommended in the original post on news.ycombinator.com
Deep Learning Artificial intelligence em-routing Machine Learning heinsen-routing
Post date: 9 Apr 2023

heinsen_routing

block-recurrent-transformer-pytorch

InfluxDB

block-recurrent-transformer-py

RWKV-LM

iris

WorkOS

Related posts

From Deep to Long Learning

This page summarizes the projects mentioned and recommended in the original post on news.ycombinator.com Deep Learning Artificial intelligence em-routing Machine Learning heinsen-routing Post date: 9 Apr 2023

heinsen_routing

block-recurrent-transformer-pytorch

InfluxDB

block-recurrent-transformer-py

RWKV-LM

iris

WorkOS

Related posts

This page summarizes the projects mentioned and recommended in the original post on news.ycombinator.com
Deep Learning Artificial intelligence em-routing Machine Learning heinsen-routing
Post date: 9 Apr 2023