Frustrated beginner: How to approach/practice implementing papers into code?

Our great sponsors

InfluxDB - Power Real-Time Data Analytics at Scale

WorkOS - The modern identity platform for B2B SaaS

SaaSHub - Software Alternatives and Reviews

Our great sponsors

stable-baselines3

46 7,953 8.2 Python

PyTorch version of Stable Baselines, reliable implementations of reinforcement learning algorithms.

3) You can look at OpenAI baseline or something like this(https://github.com/DLR-RM/stable-baselines3) to make sure the results are reproducible.

Deep-Q-Learning

1 206 0.0 Jupyter Notebook

Tensorflow implementation of Deepminds dqn with double dueling networks

When I started my master thesis last year I was a complete noob in ML, let alone RL. I tried to search for some code I could finally understand and stumbled upon this pretty nice notebook: https://github.com/fg91/Deep-Q-Learning It's by far the best notebook I've worked with yet. Since my goal was to learn PyTorch instead of Tensorflow (used in the notebook, it's also not working properly without tweaks due an old version of TF), I started re-implementing the code in PyTorch. Good thing is that you can compare your own results to the notebook and debug everything with prints if needed. That way I learned a lot about PyTorch and DQN.

InfluxDB

www.influxdata.com sponsored

Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
Deep-Reinforcement-Learning-Algorithms-with-PyTorch

2 5,416 3.6 Python

PyTorch implementations of deep reinforcement learning algorithms and environments

This series is actually easier than the David Silver one. Also here is the github repository link - https://github.com/p-christ/Deep-Reinforcement-Learning-Algorithms-with-PyTorch

NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a more popular project.

Suggest a related project