Our great sponsors
-
WorkOS
The modern identity platform for B2B SaaS. The APIs are flexible and easy-to-use, supporting authentication, user identity, and complex enterprise features like SSO and SCIM provisioning.
-
stable-baselines3
PyTorch version of Stable Baselines, reliable implementations of reinforcement learning algorithms.
Ah sorry I understood your post. It has helped me to code quite a few of them from scratch but you can also check out https://github.com/openai/baselines or similar
I would recommend looking at Grokking Deep RL if you are looking for some hands on DRL practice in python without starting completely from scratch. You can find some of the jupyter notebooks here.
If you want to iterate quickly through different RL methods then it's a good idea to use one of the RL libraries like stable baselines 3. Then you can dig further into the methods that work best for you. Coding RL methods from scratch is very time consuming and error prone even for experienced programmers.
Related posts
- Why did Stability not copy Midjourney's RLHF process? And what's the future of Stable Diffusion?
- ACTorch: a PyTorch-based deep reinforcement learning framework for fast prototyping
- TransformerXL + PPO Baseline + MemoryGym
- Using AI to infer depth information from images in Godot 4 .NET 6 using the MiDaS monocular depth model
- I could use some basic help