mag
rnaseq
Our great sponsors
mag | rnaseq | |
---|---|---|
2 | 14 | |
179 | 758 | |
6.7% | 4.9% | |
9.5 | 9.5 | |
7 days ago | 7 days ago | |
Nextflow | Nextflow | |
MIT License | MIT License |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
mag
-
How do you use Termux?
So, I'm a bioinformatician, and currently I am extending my company's data processing pipeline which takes metagenomic sequencing data and performs a number of downstream analyses on it, the results of which are then provided to our customers. Here is an example of such a data processing pipeline: it's a script which orchestrates the execution of a number of tools to get the desired end result. Each of the green bubbles in the flowchart represents a single program which needs to be called specific settings and with data provided to it either by me (as the pipeline input) or from outputs of previous steps in the pipeline.
- Is it possible to assemble a complete bacterial genome using short reads?
rnaseq
- R pipelines for bulk RNA-seq analyses
- What are some good examples of well-engineered bioinformatics pipelines?
-
Generate GUIs and deploy bioinformatics workflows with python
First lets recognize that the framework presented has new features that don't exist in the previous DSLs you mention. Many developers highly value these additions and they along could justify a new stab at a workflow language: and for many the represent tradeoff * Interface generation * Declarative cloud resource provisionment * Static typing * Native python support This workflow has a similar level of complexity to nf-core/rnaseq (not the same, but similar in number of constituent tasks for the purpose of counting transcript abundance). It ingests raw sequencing reads, runs QC + trimming, does psuedo-alignment, recovers counts from abundance estimates, and aggregates counts over a many samples for direct use by diff-exp tools. (It is not 'running salmon'. I think that is a reductionist take.) It does this in addition to dynamically building React.js interfaces, adding static type validation to input parameters, and deploying cloud infrastructure in a simpler way. For the lines of code comparison, I think it is a weird way to compare software as the number of lines of code is no proxy for legibility, ease of development, likelihood of long-term maintenance (many more people know python than nextflow). Nonetheless nf-core/rnaseq has nearly 1000 lines alone in its workflow entrypoint alone - https://github.com/nf-core/rnaseq/blob/master/workflows/rnaseq.nf . With imported modules + subworkflows, LOC actually reaches the many thousands.. (Now I understand it is more complex and mature, but I highlight why I think the comparison is weird and wonder what you are even comparing to.) Whereas the entire logic of the presented pipeline is actually neatly encapsulated in 1200 lines of a single file. Overall this feels like a that doesn't come from a place of rational discourse, rather group dislike for something new and different. What I would like to do is address and talk about specific technical points (preferably over issues on github) so conversations can be productive and improve the tools I am working on.
- I've been really frustrated with picking the right tools for bulk RNA-seq, so I did a long literature review and wrote this workflow
- Software repository and hackathons
- Introduction to RNAseq and microRNA?
-
Tkinter for python 3.10 broken on MacOS?
Not really sure why it's a problem for you, I'm working on rnaseq and they use a very big input dataset, also outputs huge datasets too. It uses docker so you can deploy fast on VMs.
What are some alternatives?
SqueezeMeta - A complete pipeline for metagenomic analysis
atlas - A modern tool for managing database schemas
nextflow - A DSL for data-driven computational pipelines
spades - SPAdes Genome Assembler
sarek - Analysis pipeline to detect germline or somatic variants (pre-processing, variant calling and annotation) from WGS / targeted sequencing
diffexpr - Porting DESeq2 into python via rpy2
sage - Proteomics search & quantification so fast that it feels like magic
HomeBrew - 🍺 The missing package manager for macOS (or Linux)
eager - A fully reproducible and state-of-the-art ancient DNA analysis pipeline
configs - Config files used to define parameters specific to compute environments at different Institutions
Atlas - 🚀 An open and lightweight modification to Windows, designed to optimize performance, privacy and security.