webdataset
Practical_RL
Our great sponsors
webdataset | Practical_RL | |
---|---|---|
7 | 2 | |
1,962 | 5,716 | |
7.4% | 1.2% | |
8.8 | 6.0 | |
17 days ago | 18 days ago | |
Python | Jupyter Notebook | |
BSD 3-clause "New" or "Revised" License | The Unlicense |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
webdataset
-
How to use data stored in a (private) S3 Bucket for training?
As an alternative, I've looked into using WebDataset, but couldn't figure out how to access data that is stored in a private bucket.
- [D] Title: Best tools and frameworks for working with million-billion image datasets?
-
[D] Training networks on extremely large datasets (10+TB)?
You can try webdataset (https://github.com/webdataset/webdataset).
- Question: TIFF image dataset - size in RAM.
-
How to upload large amounts of data to a server?
compress it to .tar format and then load it as a webdataset
-
Does mit 6.824 help for distributed deep learning?
Would guess not but there should be some good niche resources: check out the introductory videos here https://github.com/webdataset/webdataset
-
How to effectively load a large text dataset with PyTorch?
I found a pretty good solution that is similar to the TFRecord from Tensorflow. You just need to load the data, tokenized it, and save the arrays in shards with webdataset package.
Practical_RL
- [D] implementation of MCTS in Python
-
Alternatives to OpenAI’s spinning up?
there is this great github repo where there are lectures and other resources, and have a week by week jupyter notebooks where they explain and code with homeworks at the very end of it. is basics and deepRL, but just dqn and DDPG/ppo but i think will give you good start in the topic for later star working on your own.
What are some alternatives?
NYU-DLSP20 - NYU Deep Learning Spring 2020
FunMatch-Distillation - TF2 implementation of knowledge distillation using the "function matching" hypothesis from https://arxiv.org/abs/2106.05237.
Made-With-ML - Learn how to design, develop, deploy and iterate on production-grade ML applications.
awesome-rl - Reinforcement learning resources curated
ffcv - FFCV: Fast Forward Computer Vision (and other ML workloads!)
labml - 🔎 Monitor deep learning model training and hardware usage from your mobile phone 📱
fastai - The fastai deep learning library
alpha-zero-general - A clean implementation based on AlphaZero for any game in any framework + tutorial + Othello/Gobang/TicTacToe/Connect4 and more
ModelNet40-C - Repo for "Benchmarking Robustness of 3D Point Cloud Recognition against Common Corruptions" https://arxiv.org/abs/2201.12296
redisai-examples - RedisAI showcase
PySyft - Perform data science on data that remains in someone else's server
TensorFlow-Tutorials - TensorFlow Tutorials with YouTube Videos