halutmatmul
PyTorch-Guide
halutmatmul | PyTorch-Guide | |
---|---|---|
3 | 2 | |
202 | 23 | |
- | - | |
9.4 | 1.8 | |
5 months ago | over 2 years ago | |
Python | Python | |
MIT License | - |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
halutmatmul
- Show HN: Stella Nera – Maddness Hardware Accelerator
-
10x faster matrix and vector operations
This master's thesis sort of does it, but it doesn't have any fine-tuning yet so it completely wrecks the accuracy: https://github.com/joennlae/halutmatmul.
If someone worked on contributing this to Composer [1] I'd be down to help out. I can't justify building it all on my own right now since we're 100% focused on training speedup, but I could definitely meet and talk through it, help code tricky parts, review PRs, etc.
[1] https://github.com/mosaicml/composer
PyTorch-Guide
- Useful Tools and Programs for Deep Learning with PyTorch
-
Cool PyTorch Guide/Wiki
PyTorch Guide/Wiki: https://github.com/mikeroyal/PyTorch-Guide
What are some alternatives?
QualityScaler - QualityScaler - image/video deeplearning upscaling for any GPU
NeuralCDE - Code for "Neural Controlled Differential Equations for Irregular Time Series" (Neurips 2020 Spotlight)
kernel_tuner - Kernel Tuner
cog - Containers for machine learning
3d-ken-burns - an implementation of 3D Ken Burns Effect from a Single Image using PyTorch
bittensor - Internet-scale Neural Networks
composer - Supercharge Your Model Training
TransformerEngine - A library for accelerating Transformer models on NVIDIA GPUs, including using 8-bit floating point (FP8) precision on Hopper and Ada GPUs, to provide better performance with lower memory utilization in both training and inference.
bolt - 10x faster matrix and vector operations
caer - High-performance Vision library in Python. Scale your research, not boilerplate.