hongbomiao.com
kfserving
hongbomiao.com | kfserving | |
---|---|---|
1 | 1 | |
199 | 2,113 | |
- | - | |
10.0 | 10.0 | |
5 days ago | about 1 year ago | |
HCL | Python | |
MIT License | Apache License 2.0 |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
hongbomiao.com
-
OPToggles - a feature flag Open Source project
This should cover some of it: https://www.youtube.com/watch?v=1_Iz0tRQCH4 There's this demo app from a Tesla engineer using OPA+OPAL, and I've also been writing some blog posts on the topic for my company's blog - there's still not much there, but I keep adding more every couple of weeks, so you can check that out as well!
kfserving
-
How do we assign pods properly so that KFServing can scale down GPU Instances to zero?
We are using KFServing as well. KFServing allows us to auto-scale our GPU up and down, specifically scaling to zero when its not in use. The components in KFServing also get assigned to GPU nodes when applying them to our cluster.
What are some alternatives?
Yatai - Model Deployment at Scale on Kubernetes 🦄️
soopervisor - ☁️ Export Ploomber pipelines to Kubernetes (Argo), Airflow, AWS Batch, SLURM, and Kubeflow.
Realtime-MLOps - A framework of open-source technologies to design real-time machine learning systems
kserve - Standardized Serverless ML Inference Platform on Kubernetes
KubeScript - Kubernetes meets Typescript
mosec - A high-performance ML model serving framework, offers dynamic batching and CPU/GPU pipelines to fully exploit your compute machine
create-react-native-dapp - Your next Ethereum application starts here. ⚛️ 💪 🦄
examples - 📝 Examples of how to use Neptune for different use cases and with various MLOps tools
awesome-argo - A curated list of awesome projects and resources related to Argo (a CNCF graduated project)
inferencedb - 🚀 Stream inferences of real-time ML models in production to any data lake (Experimental)
BentoML - The most flexible way to serve AI/ML models in production - Build Model Inference Service, LLM APIs, Inference Graph/Pipelines, Compound AI systems, Multi-Modal, RAG as a Service, and more!
community - Information about the Kubeflow community including proposals and governance information.