examples
kfserving
examples | kfserving | |
---|---|---|
1 | 1 | |
66 | 2,113 | |
- | - | |
9.1 | 10.0 | |
2 days ago | about 1 year ago | |
Jupyter Notebook | Python | |
MIT License | Apache License 2.0 |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
examples
kfserving
-
How do we assign pods properly so that KFServing can scale down GPU Instances to zero?
We are using KFServing as well. KFServing allows us to auto-scale our GPU up and down, specifically scaling to zero when its not in use. The components in KFServing also get assigned to GPU nodes when applying them to our cluster.
What are some alternatives?
Traffic-Lights-Classification-CNN - Coding and testing a convolutional neural network for classifying traffic lights, with Keras and Tensorflow, using LISA dataset.
soopervisor - ☁️ Export Ploomber pipelines to Kubernetes (Argo), Airflow, AWS Batch, SLURM, and Kubeflow.
ee_repository_stats - Scraping, parsing, and analyzing every public Earth Engine repository
kserve - Standardized Serverless ML Inference Platform on Kubernetes
ststats - UK Specialty Training Stats
mosec - A high-performance ML model serving framework, offers dynamic batching and CPU/GPU pipelines to fully exploit your compute machine
Ordinal_Classifier - Introduce order in your classification within 1 line
inferencedb - 🚀 Stream inferences of real-time ML models in production to any data lake (Experimental)
multi-object-tracking-in-python - 📡 implementation of multi object tracking algorithms including PMBM (Poisson Multi Bernoulli Mixture filter) in Python 🐍
BentoML - The most flexible way to serve AI/ML models in production - Build Model Inference Service, LLM APIs, Inference Graph/Pipelines, Compound AI systems, Multi-Modal, RAG as a Service, and more!
quick-deploy - Optimize, convert and deploy machine learning models as fast inference API using Triton and ORT. Currently support Hugging Face transformers, PyToch, Tensorflow, SKLearn and XGBoost models.
community - Information about the Kubeflow community including proposals and governance information.