Frustrated beginner: How to approach/practice implementing papers into code?

This page summarizes the projects mentioned and recommended in the original post on /r/reinforcementlearning

Our great sponsors
  • InfluxDB - Power Real-Time Data Analytics at Scale
  • WorkOS - The modern identity platform for B2B SaaS
  • SaaSHub - Software Alternatives and Reviews
  • stable-baselines3

    PyTorch version of Stable Baselines, reliable implementations of reinforcement learning algorithms.

  • 3) You can look at OpenAI baseline or something like this(https://github.com/DLR-RM/stable-baselines3) to make sure the results are reproducible.

  • Deep-Q-Learning

    Tensorflow implementation of Deepminds dqn with double dueling networks

  • When I started my master thesis last year I was a complete noob in ML, let alone RL. I tried to search for some code I could finally understand and stumbled upon this pretty nice notebook: https://github.com/fg91/Deep-Q-Learning It's by far the best notebook I've worked with yet. Since my goal was to learn PyTorch instead of Tensorflow (used in the notebook, it's also not working properly without tweaks due an old version of TF), I started re-implementing the code in PyTorch. Good thing is that you can compare your own results to the notebook and debug everything with prints if needed. That way I learned a lot about PyTorch and DQN.

  • InfluxDB

    Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

    InfluxDB logo
  • Deep-Reinforcement-Learning-Algorithms-with-PyTorch

    PyTorch implementations of deep reinforcement learning algorithms and environments

  • This series is actually easier than the David Silver one. Also here is the github repository link - https://github.com/p-christ/Deep-Reinforcement-Learning-Algorithms-with-PyTorch

NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a more popular project.

Suggest a related project

Related posts