tritony
budgetml
tritony | budgetml | |
---|---|---|
1 | 4 | |
38 | 1,333 | |
- | 0.2% | |
6.4 | 0.0 | |
5 months ago | 3 months ago | |
Python | Python | |
BSD 3-clause "New" or "Revised" License | Apache License 2.0 |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
tritony
-
Are you using `Triton Inference Server`?
Check it https://github.com/rtzr/tritony !
budgetml
What are some alternatives?
vllm - A high-throughput and memory-efficient inference and serving engine for LLMs
pinferencia - Python + Inference - Model Deployment library in Python. Simplest model inference server ever.
DeepSpeed - DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.
zenml - ZenML 🙏: Build portable, production-ready MLOps pipelines. https://zenml.io.
quick-deploy - Optimize, convert and deploy machine learning models as fast inference API using Triton and ORT. Currently support Hugging Face transformers, PyToch, Tensorflow, SKLearn and XGBoost models.
onnxruntime - ONNX Runtime: cross-platform, high performance ML inferencing and training accelerator
serving-compare-middleware - FastAPI middleware for comparing different ML model serving approaches
ck - Collective Mind (CM) is a small, modular, cross-platform and decentralized workflow automation framework with a human-friendly interface and reusable automation recipes to make it easier to build, run, benchmark and optimize AI, ML and other applications and systems across diverse and continuously changing models, data, software and hardware
ColossalAI - Making large AI models cheaper, faster and more accessible
fastapi-template - Completely Scalable FastAPI based template for Machine Learning, Deep Learning and any other software project which wants to use Fast API as an API framework.
experta - Expert Systems for Python
FastAPI-template - Feature rich robust FastAPI template.