make-booster VS Airflow

Compare make-booster vs Airflow and see what are their differences.

InfluxDB - Power Real-Time Data Analytics at Scale
Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
www.influxdata.com
featured
SaaSHub - Software Alternatives and Reviews
SaaSHub helps you find the best software and product alternatives
www.saashub.com
featured
make-booster Airflow
3 170
8 34,705
- 1.7%
10.0 10.0
almost 2 years ago 5 days ago
Makefile Python
MIT License Apache License 2.0
The number of mentions indicates the total number of mentions that we've tracked plus the number of user suggested alternatives.
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.

make-booster

Posts with mentions or reviews of make-booster. We have used some of these posts to build our list of alternatives and similar projects. The last one was on 2023-07-15.
  • Snakemake – A framework for reproducible data analysis
    6 projects | news.ycombinator.com | 15 Jul 2023
    For a very different approach, check out make-booster:

    https://github.com/david-a-wheeler/make-booster

    Make-booster provides utility routines intended to greatly simplify data processing (particularly a data pipeline) using GNU make. It includes some mechanisms specifically to help Python, as well as general-purpose mechanisms that can be useful in any system. In particular, it helps reliably reproduce results, and it automatically determines what needs to run and runs only that (producing a significant speedup in most cases). Released as open source software.

  • A Love Letter to Make
    5 projects | news.ycombinator.com | 20 Apr 2023
    https://github.com/david-a-wheeler/make-booster

    I think a lot of hate on make is due to poor use. If your makefile is complex, refactor it. Auto-generate dependencies (it only takes a few lines in GNU make). And don't use recursive make, that way lies madness. I also think GNU make is the wiser tool; POSIX make lacks too much in many cases.

  • The Unreasonable Effectiveness of Makefiles
    12 projects | news.ycombinator.com | 12 Aug 2022
    https://github.com/david-a-wheeler/make-booster

    From its readme:

    "This project (contained in this directory and below) provides utility routines intended to greatly simplify data processing (particularly a data pipeline) using GNU make. It includes some mechanisms specifically to help Python, as well as general-purpose mechanisms that can be useful in any system. In particular, it helps reliably reproduce results, and it automatically determines what needs to run and runs only that (producing a significant speedup in most cases)."

    "For example, imagine that Python file BBB.py says include CC, and file CC.py reads from file F.txt (and CC.py declares its INPUTS= as described below). Now if you modify file F.txt or CC.py, any rule that runs BBB.py will automatically be re-run in the correct order when you use make, even if you didn't directly edit BBB.py."

    This is NOT functionality directly provided by Python, and the overhead with >1000 files was 0.07seconds which we could live with :-).

Airflow

Posts with mentions or reviews of Airflow. We have used some of these posts to build our list of alternatives and similar projects. The last one was on 2023-12-07.

What are some alternatives?

When comparing make-booster and Airflow you can also consider the following projects:

tclmake - Partial make clone in pure Tcl

Kedro - Kedro is a toolbox for production-ready data science. It uses software engineering best practices to help you create data engineering and data science pipelines that are reproducible, maintainable, and modular.

checkexec - CLI tool to conditionally execute commands only when files in a dependency list have been updated. Like `make`, but standalone.

dagster - An orchestration platform for the development, production, and observation of data assets.

snakemake-wrappers - This is the development home of the Snakemake wrapper repository, see

n8n - Free and source-available fair-code licensed workflow automation tool. Easily automate tasks across different services.

mandala - A powerful and easy to use Python framework for experiment tracking and incremental computing

luigi - Luigi is a Python module that helps you build complex pipelines of batch jobs. It handles dependency resolution, workflow management, visualization etc. It also comes with Hadoop support built in.

dagger - Application Delivery as Code that Runs Anywhere

Apache Spark - Apache Spark - A unified analytics engine for large-scale data processing

just - 🤖 Just a command runner

Dask - Parallel computing with task scheduling