actor-critic algorithms

Our great sponsors

WorkOS - The modern identity platform for B2B SaaS

InfluxDB - Power Real-Time Data Analytics at Scale

SaaSHub - Software Alternatives and Reviews

Our great sponsors

q-learning-algorithms

1 4 0.0 Python

This repository will aim to provide implementations of q-learning algorithms (DQN, Double-DQN, ...) using Pytorch.

I learn quite some things about reinforcement learning in the last months, and I feel like I understand much better deep-Q learning algorithms (if you want, you can check my [repo](https://github.com/thomashirtz/q-learning-algorithms). I would like to change a little bit my focus towards actor-critics algorithms now. The only thing is, I feel like in comparison to Q-learning algorithms, the explanations of the papers are not as precise as for Q-learning, and explanations on the internet diverge really greatly (e.g. the original paper does not give the A2C but only the A3C for one learner).
WorkOS

workos.com
sponsored

The modern identity platform for B2B SaaS. The APIs are flexible and easy-to-use, supporting authentication, user identity, and complex enterprise features like SSO and SCIM provisioning.

NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a more popular project.

Suggest a related project