Kedro

Kedro is a toolbox for production-ready data science. It uses software engineering best practices to help you create data engineering and data science pipelines that are reproducible, maintainable, and modular. (by kedro-org)

Kedro Alternatives

Similar projects and alternatives to Kedro

  1. Hugo

    593 Kedro VS Hugo

    The world’s fastest framework for building websites.

  2. InfluxDB

    InfluxDB – Built for High-Performance Time Series Workloads. InfluxDB 3 OSS is now GA. Transform, enrich, and act on time series data directly in the database. Automate critical tasks and eliminate the need to move data externally. Download now.

    InfluxDB logo
  3. PRAW

    528 Kedro VS PRAW

    PRAW, an acronym for "Python Reddit API Wrapper", is a python package that allows for simple access to Reddit's API.

  4. black

    340 Kedro VS black

    The uncompromising Python code formatter

  5. streamlit

    310 Kedro VS streamlit

    Streamlit — A faster way to build and share data apps.

  6. Docusaurus

    308 Kedro VS Docusaurus

    Easy to maintain open source documentation websites.

  7. transformers

    212 Kedro VS transformers

    🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.

  8. rich

    157 Kedro VS rich

    Rich is a Python library for rich text and beautiful formatting in the terminal.

  9. SaaSHub

    SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives

    SaaSHub logo
  10. Mage

    79 Kedro VS Mage

    🧙 The modern replacement for Airflow. Mage is an open-source data pipeline tool for transforming and integrating data. https://github.com/mage-ai/mage-ai

  11. MLflow

    75 Kedro VS MLflow

    Open source platform for the machine learning lifecycle

  12. flyte

    39 Kedro VS flyte

    Scalable and flexible workflow orchestration platform that seamlessly unifies data, ML and analytics stacks.

  13. zenml

    35 Kedro VS zenml

    ZenML 🙏: The bridge between ML and Ops. https://zenml.io.

  14. featureform

    The Virtual Feature Store. Turn your existing data infrastructure into a feature store.

  15. metaflow

    28 Kedro VS metaflow

    Build, Manage and Deploy AI/ML Systems

  16. cookiecutter-data-science

    A logical, reasonably standardized, but flexible project structure for doing and sharing data science work.

  17. projects

    19 Kedro VS projects

    Sample projects using Ploomber. (by ploomber)

  18. trio

    19 Kedro VS trio

    Trio – a friendly Python library for async concurrency and I/O

  19. huey

    14 Kedro VS huey

    a little task queue for python

  20. Airflow

    188 Kedro VS Airflow

    Apache Airflow - A platform to programmatically author, schedule, and monitor workflows

  21. nextflow

    9 Kedro VS nextflow

    A DSL for data-driven computational pipelines

  22. DurableTask

    Durable Task Framework allows users to write long running persistent workflows in C# using the async/await capabilities.

  23. SaaSHub

    SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives

    SaaSHub logo
NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a better Kedro alternative or higher similarity.

Kedro discussion

Log in or Post with

Kedro reviews and mentions

Posts with mentions or reviews of Kedro. We have used some of these posts to build our list of alternatives and similar projects. The last one was on 2024-11-13.
  • 20 Open Source Tools I Recommend to Build, Share, and Run AI Projects
    11 projects | dev.to | 13 Nov 2024
    Kedro is an ML development framework that brings data science projects from pilot development to production by creating reproducible, maintainable, and modular data science code. Kedro has a data catalog for data handling, support pipeline building, and a standardized template for code maintainability and consistency to effectively do this. Its data catalog uses lightweight data connectors to manage and track datasets. This allows you to use the same pipeline to build multiple production-level codes across your system.
  • Kedro – An open-source framework for data science code
    1 project | news.ycombinator.com | 17 Aug 2024
  • 10 Open Source MLOps Projects You Didn’t Know About
    12 projects | dev.to | 1 Aug 2024
    Kedro A serious problem with machine learning projects is the complex process involved in taking models from development to production. Kedro is an open source tool that solves this problem by employing software engineering best practices to help you create data engineering and data science pipelines that are reproducible, maintainable, and modular.
  • 25 Open Source AI Tools to Cut Your Development Time in Half
    8 projects | dev.to | 11 Jul 2024
    Kedro is an ML development framework for creating reproducible, maintainable, modular data science code. Kedro improves AI project development experience via data abstraction and code organization. Using lightweight data connectors, it provides a centralized data catalog to manage and track datasets throughout a project. This enables data scientists to focus on building production level code through Kedro's data pipelines, enabling other stakeholders to use the same pipelines in different parts of the system.
  • Nextflow: Data-Driven Computational Pipelines
    9 projects | news.ycombinator.com | 10 Aug 2023
    Interesting, thanks for sharing. I'll definitely take a look, although at this point I am so comfortable with Snakemake, it is a bit hard to imagine what would convince me to move to another tool. But I like the idea of composable pipelines: I am building a tool (too early to share) that would allow to lay Snakemake pipelines on top of each other using semi-automatic data annotations similar to how it is done in kedro (https://github.com/kedro-org/kedro).
  • A Polars exploration into Kedro
    6 projects | dev.to | 17 May 2023
    # pyproject.toml [project] dependencies = [ "kedro @ git+https://github.com/kedro-org/kedro@3ea7231", "kedro-datasets[pandas.CSVDataSet,polars.CSVDataSet] @ git+https://github.com/kedro-org/kedro-plugins@3b42fae#subdirectory=kedro-datasets", ]
  • What are some open-source ML pipeline managers that are easy to use?
    7 projects | /r/mlops | 3 May 2023
    So there's 2 sides to pipeline management: the actual definition of the pipelines (in code) and how/when/where you run them. Some tools like prefect or airflow do both of them at once, but for the actual pipeline definition I'm a fan of https://kedro.org. You can then use most available orchestrators to run those pipelines on whatever schedule and architecture you want.
  • How do data scientists combine Kedro and Databricks?
    1 project | dev.to | 19 Apr 2023
    We have set up a milestone on GitHub so you can check in on our progress and contribute if you want to. To suggest features to us, report bugs, or just see what we're working on right now, visit the Kedro projects on GitHub.
  • How do you organize yourself during projects?
    1 project | /r/learnmachinelearning | 28 Mar 2023
    you could use a project framework like kedro to force you to be more disciplined about how you structure your projects. I'd also recommend checking out this book: Edna Ridge - Guerrilla Analytics: A Practical Approach to Working with Data
  • Futuristic documentation systems in Python, part 1: aiming for more
    3 projects | dev.to | 14 Mar 2023
    Recently I started a position as Developer Advocate for Kedro, an opinionated data science framework, and one of the things we're doing is exploring what are the best open source tools we can use to create our documentation.
  • A note from our sponsor - SaaSHub
    www.saashub.com | 24 May 2025
    SaaSHub helps you find the best software and product alternatives Learn more →

Stats

Basic Kedro repo stats
33
10,342
9.4
7 days ago

kedro-org/kedro is an open source project licensed under Apache License 2.0 which is an OSI approved license.

The primary programming language of Kedro is Python.


Sponsored
InfluxDB – Built for High-Performance Time Series Workloads
InfluxDB 3 OSS is now GA. Transform, enrich, and act on time series data directly in the database. Automate critical tasks and eliminate the need to move data externally. Download now.
www.influxdata.com

Did you know that Python is
the 2nd most popular programming language
based on number of references?