[N] Easily profile FastAPI model serving

This page summarizes the projects mentioned and recommended in the original post on /r/MachineLearning

InfluxDB - Power Real-Time Data Analytics at Scale
Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
www.influxdata.com
featured
SaaSHub - Software Alternatives and Reviews
SaaSHub helps you find the best software and product alternatives
www.saashub.com
featured
  • graphsignal-python

    Graphsignal Tracer for Python

  • We've added a simple way to profile any model serving endpoint, including FastAPI, to identify bottlenecks and make inference (incl. data processing) faster, especially for big models and data. Wanted to share it here in case someone is struggling with profiling and monitoring of deployed code and models. By default, generic Python profiler will automatically profile some of the inferences (and measure all inferences). You can also specify other profilers for PyTorch, TensorFlow, Jax and ONNX Runtime. All profiles and metrics will be available on the SaaS dashboard, no need to setup anything. A couple of links to get started: Repo: https://github.com/graphsignal/graphsignal FastAPI example: https://graphsignal.com/docs/integrations/fastapi/ Happy for any feedback!

  • InfluxDB

    Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

    InfluxDB logo
NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a more popular project.

Suggest a related project

Related posts

  • Show HN: Python Monitoring for LLMs, OpenAI, Inference, GPUs

    2 projects | news.ycombinator.com | 4 Apr 2023
  • Show HN: Python Monitoring for AI: LLMs, OpenAI, Inference, GPUs

    1 project | news.ycombinator.com | 29 Mar 2023
  • Show HN: Python Monitoring for AI: LLMs, OpenAI, Inference, GPUs

    2 projects | news.ycombinator.com | 28 Mar 2023
  • [N] Monitor OpenAI API Latency, Tokens, Rate Limits, and More with Graphsignal

    1 project | /r/MachineLearning | 31 Jan 2023
  • Monitor OpenAI API Latency, Tokens, Rate Limits, and More

    1 project | news.ycombinator.com | 31 Jan 2023