Multi-model serving options

This page summarizes the projects mentioned and recommended in the original post on /r/mlops

SaaSHub - Software Alternatives and Reviews
SaaSHub helps you find the best software and product alternatives
www.saashub.com
featured
  • jina

    ☁️ Build multimodal AI applications with cloud-native stack

    Jina let’s you serve all of your models through the same Gateway while deploying them as individual microservices. You can also tie your models together in a pipeline if needed. Also some nice ML focussed features such as dynamic batching.

  • SaaSHub

    SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives

    SaaSHub logo
  • server

    The Triton Inference Server provides an optimized cloud and edge inferencing solution. (by triton-inference-server)

    You've already mentioned Seldon Core which is well worth looking at but if you're just after the raw multi-model serving aspect rather than a fully-fledged deployment framework you should maybe take a look at the individual inference servers: Triton Inference Server and MLServer both support multi-model serving for a wide variety of frameworks (and custom python models). MLServer might be a better option as it has an MLFlow runtime but only you will be able to decide that. There also might be other inference servers that do MMS that I'm not aware of.

  • MLServer

    An inference server for your machine learning models, including support for multiple frameworks, multi-model serving and more

    You've already mentioned Seldon Core which is well worth looking at but if you're just after the raw multi-model serving aspect rather than a fully-fledged deployment framework you should maybe take a look at the individual inference servers: Triton Inference Server and MLServer both support multi-model serving for a wide variety of frameworks (and custom python models). MLServer might be a better option as it has an MLFlow runtime but only you will be able to decide that. There also might be other inference servers that do MMS that I'm not aware of.

NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a more popular project.

Suggest a related project

Related posts

  • Jina.ai: Self-host Multimodal models

    1 project | news.ycombinator.com | 26 Jan 2024
  • Recommend a Lightweight Launcher with Nested Folders

    1 project | /r/androidapps | 26 Mar 2023
  • How can we match images in our database?

    2 projects | /r/learnmachinelearning | 16 Mar 2023
  • Can AI 3D model search engines be a thing this year?

    1 project | /r/blender | 26 Feb 2023
  • [P] Open-source Neural Search framework to implement semantic search & multimedia search. Just released 2.0, seeking your feedback.

    6 projects | /r/MachineLearning | 3 Jul 2021