10 Must-Know Open Source Platform Engineering Tools for AI/ML Workflows

This page summarizes the projects mentioned and recommended in the original post on dev.to

Judoscale - Save 47% on cloud hosting with autoscaling that just works
Judoscale integrates with Django, FastAPI, Celery, and RQ to make autoscaling easy and reliable. Save big, and say goodbye to request timeouts and backed-up task queues.
judoscale.com
featured
CodeRabbit: AI Code Reviews for Developers
Revolutionize your code reviews with AI. CodeRabbit offers PR summaries, code walkthroughs, 1-click suggestions, and AST-based analysis. Boost productivity and code quality across all major languages with each PR.
coderabbit.ai
featured
  1. kubeflow

    Machine Learning Toolkit for Kubernetes

    Kubeflow is a Kubernetes-native, open source platform that simplifies ML workflow management on Kubernetes. It handles the complexities of containerization and supports end-to-end pipeline automation and distributed training on large datasets, making it ideal for production-grade ML systems.

  2. Judoscale

    Save 47% on cloud hosting with autoscaling that just works. Judoscale integrates with Django, FastAPI, Celery, and RQ to make autoscaling easy and reliable. Save big, and say goodbye to request timeouts and backed-up task queues.

    Judoscale logo
  3. feast

    The Open Source Feature Store for AI/ML

    Unlike the other tools, Feast solves a different issue: the management of ML feature data. Feast simplifies the features management by storing and managing the code used to generate machine learning features, and facilitates the deployment of these features into production. Typically, it integrates with your data sources to streamline management.

  4. prometheus

    The Prometheus monitoring system and time series database.

    Prometheus handles everything related to alerting and monitoring your metrics. As an open source monitoring platform tool, it allows AI developers and ML engineers to gain insights into their Infrastructures, create custom dashboards, and monitor their ML workflows in real time.

  5. MLflow

    Open source platform for the machine learning lifecycle

    MLflow provides developers with comprehensive tools for managing the entire ML lifecycle. Its four primary components—tracking, models, projects, and model registry—facilitate efficient, reproducible, and scalable ML pipeline building.

  6. dvc

    🦉 Data Versioning and ML Experiments

    Data Version Control is a powerful version control tool tailored for ML workflows. It ensures reproducibility by tracking and sharing data, pipelines, experiments, and models. With its Git-like interface, it integrates seamlessly with existing Git repositories. It supports various cloud storage like AWS S3 and Azure Blob, thus enabling versioning of large datasets without bloating your Git repositories.

  7. Airflow

    Apache Airflow - A platform to programmatically author, schedule, and monitor workflows

    Apache Airflow offers simplicity when it comes to scheduling, authoring, and monitoring ML workflows using Python. The tool's greatest advantage is its compatibility with any system or process you are running. This also eliminates manual intervention and increases team productivity, which aligns with the principles of Platform Engineering tools.

NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a more popular project.

Suggest a related project

Related posts

  • Top 10 MLOps Tools for 2025

    5 projects | dev.to | 5 Nov 2024
  • [D] What’s the simplest, most lightweight but complete and 100% open source MLOps toolkit? -> MY OWN CONCLUSIONS

    16 projects | /r/MachineLearning | 30 Mar 2021
  • Data Version Control

    5 projects | news.ycombinator.com | 19 Oct 2024
  • 10 Open Source Tools for Building MLOps Pipelines

    9 projects | dev.to | 6 Jun 2024
  • FLaNK 15 Jan 2024

    21 projects | dev.to | 15 Jan 2024