community
kfserving
community | kfserving | |
---|---|---|
1 | 1 | |
151 | 2,113 | |
0.7% | - | |
7.7 | 10.0 | |
3 days ago | about 1 year ago | |
Jsonnet | Python | |
Apache License 2.0 | Apache License 2.0 |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
community
-
How can we read variables from file and use them in promql?
However I am not able to figure it out, how can I feed the string xyz_stack_1 to grafana. I have setup docker-compose.yaml file to start up all the containers. The configuration is done through prometheus.yaml, grafana.ini, dashboards.yaml and datasources.yaml
kfserving
-
How do we assign pods properly so that KFServing can scale down GPU Instances to zero?
We are using KFServing as well. KFServing allows us to auto-scale our GPU up and down, specifically scaling to zero when its not in use. The components in KFServing also get assigned to GPU nodes when applying them to our cluster.
What are some alternatives?
kserve - Standardized Serverless ML Inference Platform on Kubernetes
soopervisor - ☁️ Export Ploomber pipelines to Kubernetes (Argo), Airflow, AWS Batch, SLURM, and Kubeflow.
pipelines - Machine Learning Pipelines for Kubeflow
couler - Unified Interface for Constructing and Managing Workflows on different workflow engines, such as Argo Workflows, Tekton Pipelines, and Apache Airflow.
mosec - A high-performance ML model serving framework, offers dynamic batching and CPU/GPU pipelines to fully exploit your compute machine
elyra - Elyra extends JupyterLab with an AI centric approach.
examples - 📝 Examples of how to use Neptune for different use cases and with various MLOps tools
prometheus - A docker-compose stack for Prometheus monitoring
inferencedb - 🚀 Stream inferences of real-time ML models in production to any data lake (Experimental)
kubeflow - Machine Learning Toolkit for Kubernetes
BentoML - The most flexible way to serve AI/ML models in production - Build Model Inference Service, LLM APIs, Inference Graph/Pipelines, Compound AI systems, Multi-Modal, RAG as a Service, and more!