Our great sponsors
-
aimet
AIMET is a library that provides advanced quantization and compression techniques for trained neural network models.
-
tkDNN
Deep neural network library and toolkit to do high performace inference on NVIDIA jetson platforms
-
WorkOS
The modern identity platform for B2B SaaS. The APIs are flexible and easy-to-use, supporting authentication, user identity, and complex enterprise features like SSO and SCIM provisioning.
Qualcomm AIMET may help you
Develop your own tkDNN solution. Try it out and improve it. And improve your hardware if possible.
NOTE:
The number of mentions on this list indicates mentions on common posts plus user suggested alternatives.
Hence, a higher number means a more popular project.
Related posts
- Eagle 7B: Soaring past Transformers
- [R] RWKV: Reinventing RNNs for the Transformer Era
- 4096 Context length (and beyond)
- rwkv.cpp: FP16 & INT4 inference on CPU for RWKV language model (r/MachineLearning)
- I was looking for some great quantization open-source libraries that could actually be applied in production (both edge or cloud CPU/GPU). Do you know if I am missing any good libraries?