How do we assign pods properly so that KFServing can scale down GPU Instances to zero?

This page summarizes the projects mentioned and recommended in the original post on /r/codehunter

InfluxDB - Power Real-Time Data Analytics at Scale
Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
www.influxdata.com
featured
SaaSHub - Software Alternatives and Reviews
SaaSHub helps you find the best software and product alternatives
www.saashub.com
featured
  • kfserving

    Discontinued Standardized Serverless ML Inference Platform on Kubernetes [Moved to: https://github.com/kserve/kserve]

  • We are using KFServing as well. KFServing allows us to auto-scale our GPU up and down, specifically scaling to zero when its not in use. The components in KFServing also get assigned to GPU nodes when applying them to our cluster.

  • InfluxDB

    Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

    InfluxDB logo
NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a more popular project.

Suggest a related project

Related posts

  • ArdEEG shield for Arduino UNO R4 to measure EEG, EMG, and ECG bio-signals

    1 project | news.ycombinator.com | 3 May 2024
  • Open-Source Fault Tree Analysis (Osfta)

    1 project | news.ycombinator.com | 3 May 2024
  • Ray: Unified framework for scaling AI and Python applications

    1 project | news.ycombinator.com | 3 May 2024
  • MPK MK3 x Key 25 x Arturia M.2

    1 project | /r/ableton | 11 Feb 2022
  • Firefox Webserial Addon

    2 projects | news.ycombinator.com | 3 May 2024