Our great sponsors
-
RWKV-LM
RWKV is an RNN with transformer-level LLM performance. It can be directly trained like a GPT (parallelizable). So it's combining the best of RNN and transformer - great performance, fast inference, saves VRAM, fast training, "infinite" ctx_len, and free sentence embedding.
-
SimpleReinforcementLearning
A demonstration of table based, SARSA reinforcement learning for a simple cat/mouse game
-
WorkOS
The modern identity platform for B2B SaaS. The APIs are flexible and easy-to-use, supporting authentication, user identity, and complex enterprise features like SSO and SCIM provisioning.
-
dqn-atari
A TensorFlow based implementation of the DeepMind Atari playing "Deep Q Learning" agent that works reasonably well (by gtoubassi)
This is what RWKV (https://github.com/BlinkDL/RWKV-LM) was made for, and what it will be good at.
Wow. Pretty darn cool! <3 :'))))
+1 you beat me to the punch! I think its helpful to start with simple RL and ignore the "deep" part to get the basics. The first several lectures in this series do that well. It helped me build a simple "cat and mouse" RL simulation https://github.com/gtoubassi/SimpleReinforcementLearning and ultimately a reproduction of the DQN atari game playing agent: https://github.com/gtoubassi/dqn-atari.
+1 you beat me to the punch! I think its helpful to start with simple RL and ignore the "deep" part to get the basics. The first several lectures in this series do that well. It helped me build a simple "cat and mouse" RL simulation https://github.com/gtoubassi/SimpleReinforcementLearning and ultimately a reproduction of the DQN atari game playing agent: https://github.com/gtoubassi/dqn-atari.
Related posts
- Do LLMs need a context window?
- Paving the way to efficient architectures: StripedHyena-7B
- Understanding Deep Learning
- "If you see a startup claiming to possess top-secret results leading to human level AI, they're lying or delusional. Don't believe them!" - Yann LeCun, on the conspiracy theories of "X company has reached AGI in secret"
- Meta's plan to offer free commercial AI models puts pressure on Google, OpenAI