SaaSHub helps you find the best software and product alternatives Learn more →
Top 18 kubeflow Open-Source Projects
-
InfluxDB
Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
-
couler
Unified Interface for Constructing and Managing Workflows on different workflow engines, such as Argo Workflows, Tekton Pipelines, and Apache Airflow.
-
SaaSHub
SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives
-
deployKF
deployKF builds machine learning platforms on Kubernetes. We combine the best of Kubeflow, Airflow†, and MLflow† into a complete platform.
-
hongbomiao.com
🦋 A personal research and development (R&D) lab that facilitates the sharing of knowledge.
-
Kubernetes-Guide
Kubernetes Guide. Learn all about Kubernetes monitoring, networking, and containers. Whether you're running Kubernetes Locally or in the Cloud ( Azure, AWS, and GCP).
-
awesome-kubeflow
A curated list of awesome projects and resources related to Kubeflow (a CNCF incubating project)
-
community
Information about the Kubeflow community including proposals and governance information. (by kubeflow)
-
Fast-Kubeflow
This repo covers Kubeflow Environment with LABs: Kubeflow GUI, Jupyter Notebooks on pods, Kubeflow Pipelines, Experiments, KALE, KATIB (AutoML: Hyperparameter Tuning), KFServe (Model Serving), Training Operators (Distributed Training), Projects, etc.
-
soopervisor
☁️ Export Ploomber pipelines to Kubernetes (Argo), Airflow, AWS Batch, SLURM, and Kubeflow.
-
SaaSHub
SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives
Project mention: [Discussion] Guidance on training ML models on Kubernetes | /r/MachineLearning | 2023-05-24You could use https://github.com/kubeflow/training-operator directly.
author seems to be describing the kind of patterns you might make with https://argoproj.github.io/argo-workflows/ . or see for example https://github.com/couler-proj/couler , which is an sdk for describing tasks that may be submitted to different workflow engines on the backend.
it's a little confusing to me that the author seems to object to "pipelines" and then equate them with messaging-queues. for me at least, "pipeline" vs "workflow-engine" vs "scheduler" are all basically synonyms in this context. those things may or may not be implemented with a message-queue for persistence, but the persistence layer itself is usually below the level of abstraction that $current_problem is really concerned with. like the author says, eventually you have to track state/timestamps/logs, but you get that from the beginning if you start with a workflow engine.
i agree with author that message-queues should not be a knee-jerk response to most problems because the LoE for edge-cases/observability/monitoring is huge. (maybe reach for a queue only if you may actually overwhelm whatever the "scheduler" can handle.) but don't build the scheduler from scratch either.. use argowf, kubeflow, or a more opinionated framework like airflow, mlflow, databricks, aws lamda or step-functions. all/any of these should have config or api that's robust enough to express rate-limit/retry stuff. almost any of these choices has better observability out-of-the-box than you can easily get from a queue. but most importantly.. they provide idioms for handling failure that data-science folks and junior devs can work with. the right way to structure code is just much more clear and things like structuring messages/events, subclassing workers, repeating/retrying tasks, is just harder to mess up.
Project mention: We are excited to announce the release of deployKF! It's an open-source project that makes it actually easy to deploy and maintain Kubeflow (and more) on Kubernetes. | /r/Kubeflow | 2023-08-12
Project mention: How can we read variables from file and use them in promql? | /r/grafana | 2023-05-29However I am not able to figure it out, how can I feed the string xyz_stack_1 to grafana. I have setup docker-compose.yaml file to start up all the containers. The configuration is done through prometheus.yaml, grafana.ini, dashboards.yaml and datasources.yaml
Kubeflow is an ML platform like Sagemaker or Databricks that you can self-host in a Kubernetes cluster.
Installing/deploying it is as complicated as it sounds, but we've put together an infrastructure project that lets you '1-click' install it even in tiny environments.
The GH repo (also linked in blog) allows you to start Kubeflow in a codespace or small device using a docker container -- this is both good for trying it out and developing it into your own internal ML platform.
https://github.com/treebeardtech/kubeflow-helm
kubeflow related posts
-
We are excited to announce the release of deployKF! It's an open-source project that makes it actually easy to deploy and maintain Kubeflow (and more) on Kubernetes.
-
awslabs/data-on-eks: DoEKS is a tool to build, deploy and scale Data Platforms on Amazon EKS
-
Fast-Kubeflow: Kubeflow Tutorial, Sample Usage Scenarios (Howto: Hands-on LAB)
-
Fast-Kubeflow: Kubeflow Tutorial, Sample Usage Scenarios (Howto: Hands-on LAB)
-
Fast-Kubeflow: Kubeflow Tutorial, Sample Usage Scenarios (Howto: Hands-on LAB)
-
Fast-Kubeflow: Kubeflow Tutorial, Sample Usage Scenarios (Howto: Hands-on LAB)
-
Fast-Kubeflow: Kubeflow Tutorial, Sample Usage Scenarios (Howto: Hands-on LAB)
-
A note from our sponsor - SaaSHub
www.saashub.com | 21 May 2024
Index
What are some of the best open-source kubeflow projects? This list will help you:
Project | Stars | |
---|---|---|
1 | kubeflow | 13,759 |
2 | pipelines | 3,457 |
3 | kserve | 3,111 |
4 | elyra | 1,780 |
5 | training-operator | 1,477 |
6 | couler | 891 |
7 | arena | 710 |
8 | data-on-eks | 517 |
9 | mpi-operator | 403 |
10 | deployKF | 275 |
11 | hongbomiao.com | 202 |
12 | Kubernetes-Guide | 188 |
13 | awesome-kubeflow | 182 |
14 | kfp-tekton | 163 |
15 | community | 152 |
16 | Fast-Kubeflow | 72 |
17 | soopervisor | 43 |
18 | kubeflow-bootstrap | 20 |
Sponsored