Q-Transformer: Scalable Reinforcement Learning via Autoregressive Q-Functions

This page summarizes the projects mentioned and recommended in the original post on news.ycombinator.com

CodeRabbit: AI Code Reviews for Developers
Revolutionize your code reviews with AI. CodeRabbit offers PR summaries, code walkthroughs, 1-click suggestions, and AST-based analysis. Boost productivity and code quality across all major languages with each PR.
coderabbit.ai
featured
Nutrient – The #1 PDF SDK Library, trusted by 10K+ developers
Other PDF SDKs promise a lot - then break. Laggy scrolling, poor mobile UX, tons of bugs, and lack of support cost you endless frustrations. Nutrient’s SDK handles billion-page workloads - so you don’t have to debug PDFs. Used by ~1 billion end users in more than 150 different countries.
www.nutrient.io
featured
  1. RWKV-LM

    RWKV (pronounced RwaKuv) is an RNN with great LLM performance, which can also be directly trained like a GPT transformer (parallelizable). We are at RWKV-7 "Goose". So it's combining the best of RNN and transformer - great performance, linear time, constant space (no kv-cache), fast training, infinite ctx_len, and free sentence embedding.

    This is what RWKV (https://github.com/BlinkDL/RWKV-LM) was made for, and what it will be good at.

    Wow. Pretty darn cool! <3 :'))))

  2. CodeRabbit

    CodeRabbit: AI Code Reviews for Developers. Revolutionize your code reviews with AI. CodeRabbit offers PR summaries, code walkthroughs, 1-click suggestions, and AST-based analysis. Boost productivity and code quality across all major languages with each PR.

    CodeRabbit logo
  3. SimpleReinforcementLearning

    A demonstration of table based, SARSA reinforcement learning for a simple cat/mouse game

    +1 you beat me to the punch! I think its helpful to start with simple RL and ignore the "deep" part to get the basics. The first several lectures in this series do that well. It helped me build a simple "cat and mouse" RL simulation https://github.com/gtoubassi/SimpleReinforcementLearning and ultimately a reproduction of the DQN atari game playing agent: https://github.com/gtoubassi/dqn-atari.

  4. dqn-atari

    A TensorFlow based implementation of the DeepMind Atari playing "Deep Q Learning" agent that works reasonably well (by gtoubassi)

    +1 you beat me to the punch! I think its helpful to start with simple RL and ignore the "deep" part to get the basics. The first several lectures in this series do that well. It helped me build a simple "cat and mouse" RL simulation https://github.com/gtoubassi/SimpleReinforcementLearning and ultimately a reproduction of the DQN atari game playing agent: https://github.com/gtoubassi/dqn-atari.

NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a more popular project.

Suggest a related project

Related posts

  • Ask HN: Is anybody building an alternative transformer?

    3 projects | news.ycombinator.com | 14 Feb 2025
  • Do LLMs need a context window?

    1 project | news.ycombinator.com | 25 Dec 2023
  • Paving the way to efficient architectures: StripedHyena-7B

    1 project | news.ycombinator.com | 8 Dec 2023
  • Understanding Deep Learning

    1 project | news.ycombinator.com | 26 Nov 2023
  • "If you see a startup claiming to possess top-secret results leading to human level AI, they're lying or delusional. Don't believe them!" - Yann LeCun, on the conspiracy theories of "X company has reached AGI in secret"

    1 project | /r/singularity | 26 Jun 2023

Did you know that Python is
the 2nd most popular programming language
based on number of references?