autocvd
TransformerEngine
autocvd | TransformerEngine | |
---|---|---|
1 | 2 | |
1 | 1,450 | |
- | 6.4% | |
4.4 | 9.5 | |
11 months ago | 6 days ago | |
Python | Python | |
MIT License | Apache License 2.0 |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
autocvd
TransformerEngine
-
Benchmarking Large Language Models on NVIDIA H100 GPUs with CoreWeave (Part 1)
4090 now has its 8-bit float enabled as well, see the [transformer engine issue](https://github.com/NVIDIA/TransformerEngine/issues/15)
-
GPUs for Deep Learning in 2023 – An In-depth Analysis
Would be curious to see your benchmarks. Btw, Nvidia will be providing support for fp8 in a future release of CUDA - https://github.com/NVIDIA/TransformerEngine/issues/15
I think TMA may not matter as much for consumer cards given the disproportionate amount of fp32 / int32 compute that they have.
Would be interesting to see how close to theoretical folks are able to get once CUDA support comes through.
What are some alternatives?
g2eth - Get mining specs on given GPUs
Whisper - High-performance GPGPU inference of OpenAI's Whisper automatic speech recognition (ASR) model
autocvd-go - A golang cli tool for setting `CUDA_VISIBLE_DEVICES` based on GPU utilization.
warp-drive - Extremely Fast End-to-End Deep Multi-Agent Reinforcement Learning Framework on a GPU (JMLR 2022)
foolbox - A Python toolbox to create adversarial examples that fool neural networks in PyTorch, TensorFlow, and JAX
ivy - The Unified AI Framework
best-of-ml-python - 🏆 A ranked list of awesome machine learning Python libraries. Updated weekly.
nanoGPT - The simplest, fastest repository for training/finetuning medium-sized GPTs.
Artificial-Intelligence-Deep-Learning-Machine-Learning-Tutorials - A comprehensive list of Deep Learning / Artificial Intelligence and Machine Learning tutorials - rapidly expanding into areas of AI/Deep Learning / Machine Vision / NLP and industry specific areas such as Climate / Energy, Automotives, Retail, Pharma, Medicine, Healthcare, Policy, Ethics and more.
fastaudio - 🔊 Audio and fastai v2
d2l-en - Interactive deep learning book with multi-framework code, math, and discussions. Adopted at 500 universities from 70 countries including Stanford, MIT, Harvard, and Cambridge.
liberate-fhe - A Fully Homomorphic Encryption (FHE) library for bridging the gap between theory and practice with a focus on performance and accuracy.