Alternatives to nextflow?

This page summarizes the projects mentioned and recommended in the original post on /r/bioinformatics

Judoscale - Save 47% on cloud hosting with autoscaling that just works
Judoscale integrates with Django, FastAPI, Celery, and RQ to make autoscaling easy and reliable. Save big, and say goodbye to request timeouts and backed-up task queues.
judoscale.com
featured
InfluxDB high-performance time series database
Collect, organize, and act on massive volumes of high-resolution data to power real-time intelligent systems.
influxdata.com
featured
  1. nextflow

    A DSL for data-driven computational pipelines

    So just for some context, I'm coming at this from an angle of a developer, not really of a bioinformatician. First of all, linting in my opinion, is really lacking. Sure the nf-core toolset lints, but that's more so for checking that your code-base follows nf-core guidlelines. It does not flag bugs or inconsistencies with your workflows, or processes. There's really no language support (syntax, language rules, etc.) you would expect to find with other DSL's or programming languages in general. Furthermore, while the script section error messages within a process are decently informative, most errors don't really help. An example is like the one in this github issue.

  2. Judoscale

    Save 47% on cloud hosting with autoscaling that just works. Judoscale integrates with Django, FastAPI, Celery, and RQ to make autoscaling easy and reliable. Save big, and say goodbye to request timeouts and backed-up task queues.

    Judoscale logo
  3. awesome-pipeline

    A curated list of awesome pipeline toolkits inspired by Awesome Sysadmin

    Hi everyone. So I've been using nextflow for about a month or so, having developed a few pipelines and I've found the debugging experience absolutely abysmal. Although nextflow has great observability with tower, and great community support with nf-core, the uninformative error messages is souring the experience for me. There are soooo many pipeline frameworks out there, but I'm wondering if anyone has come across one similar to nextflow in offering observability, a strong community behind it, multiple executors (container image based preferably) and an awesome debugging experience? I would favor a python based approach, but not sure snakemake is the one I'm looking for.

  4. genopype

    Architecture for creating bash pipelines, in particular, for bioinformatics

    I wrote a package for my needs called GenoPype that is basically what you're describing. https://github.com/jolespin/genopype I don't claim that it's better software than Snakemake or Nextflow, I just developed something that works for exactly what I need it to do in creating log files, intermediate directories, checkpoints, i/o checks, acceptable return codes, etc.

  5. ploomber

    The fastest ⚡️ way to build data pipelines. Develop iteratively, deploy anywhere. ☁️

    It really depends on your use cases, I've seen a lot of those tools that lock you into a certain syntax, framework or weird language (for instance Groovy). If you'd like to use core python or Jupyter notebooks I'd recommend Ploomber, the community support is really strong, there's an emphasis on observability and you can deploy it on any executor like Slurm, AWS Batch or Airflow. In addition, there's a free managed compute (cloud edition) where you can run certain bioinformatics flows like Alphafold or Cripresso2

NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a more popular project.

Suggest a related project

Related posts

  • Any good resources for R for bioinformatics

    2 projects | /r/labrats | 17 Nov 2022
  • Airflow's Problem

    6 projects | news.ycombinator.com | 2 Aug 2022
  • Feel very hard writing nextflow pipeline.

    2 projects | /r/bioinformatics | 11 May 2022
  • [Discussion] Applied machine learning implementation debate. Is OOP approach towards data preprocessing in python an overkill?

    3 projects | /r/MachineLearning | 3 Nov 2021
  • what do you think about airflow?

    2 projects | /r/dataengineering | 2 Oct 2021