mlrun vs dagster-example-pipeline

mlrun

MLRun is an open source MLOps platform for quickly building and managing continuous ML applications across their lifecycle. MLRun integrates into your development and CI/CD environment and automates the delivery of production data, ML pipelines, and online applications. (by mlrun)

Source Code

mlrun.org

Suggest alternative

Edit details

dagster-example-pipeline

Template Dagster repo using poetry and a single Docker container; works well with CICD (by MileTwo)

dagster Poetry Python Data Science data-engineering

Source Code

Suggest alternative

Edit details

Our great sponsors

InfluxDB - Power Real-Time Data Analytics at Scale

WorkOS - The modern identity platform for B2B SaaS

SaaSHub - Software Alternatives and Reviews

Our great sponsors

mlrun		dagster-example-pipeline
	Project
3	Mentions	1
1,294	Stars	64
6.0%	Growth	-
9.9	Activity	0.0
2 days ago	Latest Commit	about 2 years ago
Python	Language	Python
Apache License 2.0	License	Apache License 2.0

The number of mentions indicates the total number of mentions that we've tracked plus the number of user suggested alternatives.
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.

mlrun

Posts with mentions or reviews of mlrun. We have used some of these posts to build our list of alternatives and similar projects. The last one was on 2022-05-28.

Discussion on Need of Feature Stores
1 project | /r/mlops | 17 Jul 2022
I reviewed 50+ open-source MLOps tools. Here’s the result
3 projects | /r/mlops | 28 May 2022

You should also add MLRun: https://github.com/mlrun/mlrun
Has anyone here been able to deploy Mlrun successfully on Kubernetes cluster?
1 project | /r/mlops | 9 Feb 2021

dagster-example-pipeline

Posts with mentions or reviews of dagster-example-pipeline. We have used some of these posts to build our list of alternatives and similar projects. The last one was on 2022-03-25.

Developing in Dagster
2 projects | dev.to | 25 Mar 2022

The associated code repo can be found here

What are some alternatives?

When comparing mlrun and dagster-example-pipeline you can also consider the following projects:

feast - Feature Store for Machine Learning

Apache Superset - Apache Superset is a Data Visualization and Data Exploration Platform [Moved to: https://github.com/apache/superset]

flyte - Scalable and flexible workflow orchestration platform that seamlessly unifies data, ML and analytics stacks.

AWS Data Wrangler - pandas on AWS - Easy integration with Athena, Glue, Redshift, Timestream, Neptune, OpenSearch, QuickSight, Chime, CloudWatchLogs, DynamoDB, EMR, SecretManager, PostgreSQL, MySQL, SQLServer and S3 (Parquet, CSV, JSON and EXCEL).

SmartSim - SmartSim Infrastructure Library.

Prefect - The easiest way to build, run, and monitor data pipelines at scale.

phidata - Build AI Assistants with memory, knowledge and tools.

canarypy - CanaryPy - A light and powerful canary release for Data Pipelines

mosec - A high-performance ML model serving framework, offers dynamic batching and CPU/GPU pipelines to fully exploit your compute machine

portable-data-stack-dagster - A portable Datamart and Business Intelligence suite built with Docker, Dagster, dbt, DuckDB, PostgreSQL and Superset

ploomber - The fastest ⚡️ way to build data pipelines. Develop iteratively, deploy anywhere. ☁️

aws-data-wrangler - pandas on AWS - Easy integration with Athena, Glue, Redshift, Timestream, Neptune, OpenSearch, QuickSight, Chime, CloudWatchLogs, DynamoDB, EMR, SecretManager, PostgreSQL, MySQL, SQLServer and S3 (Parquet, CSV, JSON and EXCEL). [Moved to: https://github.com/aws/aws-sdk-pandas]