blog
Practical_RL
blog | Practical_RL | |
---|---|---|
5 | 2 | |
2,011 | 5,716 | |
5.0% | 0.5% | |
9.8 | 6.0 | |
4 days ago | 21 days ago | |
Jupyter Notebook | Jupyter Notebook | |
- | The Unlicense |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
blog
-
Refact LLM: New 1.6B code model reaches 32% HumanEval and is SOTA for the size
[4] https://github.com/huggingface/blog/blob/main/starcoder.md
-
A comprehensive guide to running Llama 2 locally
If you just want to do inference/mess around with the model and have a 16GB GPU, then this[0] is enough to paste into a notebook. You need to have access to the HF models though.
0. https://github.com/huggingface/blog/blob/main/llama2.md#usin...
-
Let’s train your first Offline Decision Transformer model from scratch 🤖
The hands-on 👉https://github.com/huggingface/blog/blob/main/notebooks/101_train-decision-transformers.ipynb
-
How to switch to half precision fp16?
I'm also running the optimized script but it doesn't run with 512x512 on my RTX3050 Ti mobile. On this website, they recommend to switch to fp16 for GPUs with less than 10gb of vram.
-
Are people hiding their deep learning code?
Here's a notebook illustrating how to train a language model from scratch: https://github.com/huggingface/blog/blob/master/notebooks/01_how_to_train.ipynb
Practical_RL
- [D] implementation of MCTS in Python
-
Alternatives to OpenAI’s spinning up?
there is this great github repo where there are lectures and other resources, and have a week by week jupyter notebooks where they explain and code with homeworks at the very end of it. is basics and deepRL, but just dqn and DDPG/ppo but i think will give you good start in the topic for later star working on your own.
What are some alternatives?
text-generation-inference - Large Language Model Text Generation Inference
webdataset - A high-performance Python-based I/O system for large (and small) deep learning problems, with strong support for PyTorch.
yolov5 - YOLOv5 🚀 in PyTorch > ONNX > CoreML > TFLite
FunMatch-Distillation - TF2 implementation of knowledge distillation using the "function matching" hypothesis from https://arxiv.org/abs/2106.05237.
awesome-notebooks - A powerful data & AI notebook templates catalog: prompts, plugins, models, workflow automation, analytics, code snippets - following the IMO framework to be searchable and reusable in any context.
awesome-rl - Reinforcement learning resources curated
QuantumKatas - Tutorials and programming exercises for learning Q# and quantum computing
alpha-zero-general - A clean implementation based on AlphaZero for any game in any framework + tutorial + Othello/Gobang/TicTacToe/Connect4 and more
stable-diffusion - Optimized Stable Diffusion modified to run on lower GPU VRAM
labml - 🔎 Monitor deep learning model training and hardware usage from your mobile phone 📱
FinMind - Open Data, more than 50 financial data. 提供超過 50 個金融資料(台股為主),每天更新 https://finmind.github.io/
redisai-examples - RedisAI showcase