Practical_RL
labml
Our great sponsors
Practical_RL | labml | |
---|---|---|
2 | 23 | |
5,681 | 1,811 | |
1.1% | 2.2% | |
6.5 | 9.8 | |
15 days ago | 8 days ago | |
Jupyter Notebook | Jupyter Notebook | |
The Unlicense | MIT License |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
Practical_RL
-
Alternatives to OpenAI’s spinning up?
there is this great github repo where there are lectures and other resources, and have a week by week jupyter notebooks where they explain and code with homeworks at the very end of it. is basics and deepRL, but just dqn and DDPG/ppo but i think will give you good start in the topic for later star working on your own.
labml
- [D] Why doesn’t your team use an experiment tracking tool?
- [D] How do you guys tune hyperparameters, when a single training run takes a long time (days to weeks)?
-
[P] Annotated deep learning paper implementations
labmlai/labml is a set of tools (tracking experiments, configurations, a bunch of helpers) we coded to ease our ML work (which later improved and open sourced). So we use it in all our projects because it makes things easier for us.
- React's UI State Model vs. Vanilla JavaScript
- [D] I'm new and scrappy. What tips do you have for better logging and documentation when training or hyperparameter training?
-
[P][D] Dynamic Hyper-parameters
The call lr() will return the current learning rate set in labml.ai app.
What are some alternatives?
nn - 🧑🏫 60 Implementations/tutorials of deep learning papers with side-by-side notes 📝; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, sophia, ...), gans(cyclegan, stylegan2, ...), 🎮 reinforcement learning (ppo, dqn), capsnet, distillation, ... 🧠
webdataset - A high-performance Python-based I/O system for large (and small) deep learning problems, with strong support for PyTorch.
guildai - Experiment tracking, ML developer tools
FunMatch-Distillation - TF2 implementation of knowledge distillation using the "function matching" hypothesis from https://arxiv.org/abs/2106.05237.
awesome-rl - Reinforcement learning resources curated
tensorflow-onnx - Convert TensorFlow, Keras, Tensorflow.js and Tflite models to ONNX
alpha-zero-general - A clean implementation based on AlphaZero for any game in any framework + tutorial + Othello/Gobang/TicTacToe/Connect4 and more
MIRNet-TFJS - TensorFlow JS models for MIRNet for low-light💡 image enhancement
redisai-examples - RedisAI showcase
Deep-Learning-Push-Up-Counter - Deep Learning approach to count the number of repetitions in a video of push ups or pull ups.
TensorFlow-Tutorials - TensorFlow Tutorials with YouTube Videos
Lottery_Ticket_Hypothesis-TensorFlow_2 - Implementing "The Lottery Ticket Hypothesis" paper by "Jonathan Frankle, Michael Carbin"