-
flyte
Scalable and flexible workflow orchestration platform that seamlessly unifies data, ML and analytics stacks.
-
InfluxDB
Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
GitHub: https://github.com/flyteorg/flyte
We have integrated Flyte with tools such as Spark, BigQuery, MPI, Sagemaker, Great Expectations, Pandera, etc. I’ve recently worked on building an Airflow provider for Flyte that enables triggering Flyte workflows from within Airflow; this is helpful if you want to build ETL pipelines in Airflow and machine learning pipelines in Flyte and use the two of them together.
NOTE:
The number of mentions on this list indicates mentions on common posts plus user suggested alternatives.
Hence, a higher number means a more popular project.
Related posts
-
Flyte 1.10: Self-hosted solution to build production-grade data and ML pipelines; now ships with monorepo, new agents and sensors, eager workflows and more 🚀 (4.1k stars on GitHub)
-
Flyte: Open-source orchestrator for building production-grade ML pipelines
-
Flyte: Advanced workflow orchestration alternative to Apache Airflow
-
Flyte 1.6.0: Self-hosted solution to build production-grade data and ML pipelines; now ships with PyTorch elastic training, image specification without dockerfile, enhanced task execution insights and more 🚀 (3.4k stars on GitHub)
-
Flyte: Open-Source Kubernetes-Native ML Orchestrator Implemented in Go