Bioinformatics

Open-source projects categorized as Bioinformatics

Top 23 Bioinformatic Open-Source Projects

  • cs-video-courses

    List of Computer Science courses with video lectures.

  • Project mention: Need advice | /r/PAK | 2023-07-12

    course Computer science is very wast field the fundamental remains same, learn basic fundamentals, data structures, concepts of object oriented programming.

  • dash

    Data Apps & Dashboards for Python. No JavaScript Required.

  • Project mention: dash VS solara - a user suggested alternative | libhunt.com/r/dash | 2023-10-13
  • InfluxDB

    Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

    InfluxDB logo
  • Biopython

    Official git repository for Biopython (originally converted from CVS)

  • Project mention: Invitación a proyecto - Biopython en Español | /r/devsarg | 2023-07-23
  • deepvariant

    DeepVariant is an analysis pipeline that uses a deep neural network to call genetic variants from next-generation DNA sequencing data.

  • Project mention: Look over my purchase, is there anything I should return? | /r/buildapc | 2023-05-06
  • awesome-single-cell

    Community-curated list of software packages and data resources for single-cell, including RNA-seq, ATAC-seq, etc.

  • Awesome-Bioinformatics

    A curated list of awesome Bioinformatics libraries and software.

  • Project mention: Skúsenosti s DNA analýzou? | /r/Slovakia | 2023-05-31
  • nextflow

    A DSL for data-driven computational pipelines

  • Project mention: Nextflow: Data-Driven Computational Pipelines | news.ycombinator.com | 2023-08-10

    > It's been a while since you can rerun/resume Nextflow pipelines

    Yes, you can resume, but you need your whole upstream DAG to be present. Snakemake can rerun a job when only the dependencies of that job are present, which allows to neatly manage the disk usage, or archive an intermediate state of a project and rerun things from there.

    > and yes, you can have dry runs in Nextflow

    You have stubs, which really isn't the same thing.

    > I have no idea what you're referring to with the 'arbitrary limit of 1000 parallel jobs' though

    I was referring to this issue: https://github.com/nextflow-io/nextflow/issues/1871. Except, the discussion doesn't give the issue a full justice. Nextflow spans each job in a separate thread, and when it tries to span 1000+ condor jobs it die with a cryptic error message. The option of -Dnxf.pool.type=sync and -Dnxf.pool.maxThreads=N prevents the ability to resume and attempts to rerun the pipeline.

    > As for deleting temporary files, there are features that allow you to do a few things related to that, and other features being implemented.

    There are some hacks for this - but nothing I would feel safe to integrate into a production tool. They are implementing something - you're right - and it's been the case for several years now, so we'll see.

    Snakemake has all that out of the box.

  • WorkOS

    The modern identity platform for B2B SaaS. The APIs are flexible and easy-to-use, supporting authentication, user identity, and complex enterprise features like SSO and SCIM provisioning.

    WorkOS logo
  • fastp

    An ultra-fast all-in-one FASTQ preprocessor (QC/adapters/trimming/filtering/splitting/merging...)

  • Project mention: R pipelines for bulk RNA-seq analyses | /r/bioinformatics | 2023-12-09

    fastp + multiQC + Salmon + DESeq2 all some nextflow workflow. It is a good exercise (not complicated) to create the pipeline from scratch the first time to properly understand each tool.

  • scanpy

    Single-cell analysis in Python. Scales to >1M cells.

  • Project mention: Renaming Genes for Scanpy Plot | /r/bioinformatics | 2023-06-15
  • minimap2

    A versatile pairwise aligner for genomic and spliced nucleotide sequences

  • Project mention: Ask HN: Comment here about whatever you're passionate about at the moment | news.ycombinator.com | 2023-11-06

    Interested as well! But the future is not so dark, things like e.g. https://github.com/lh3/minimap2 are a breath of fresh air.

  • scispacy

    A full spaCy pipeline and models for scientific/biomedical documents.

  • bioconda-recipes

    Conda recipes for the bioconda channel.

  • Rust-Bio

    This library provides implementations of many algorithms and data structures that are useful for bioinformatics. All provided implementations are rigorously tested via continuous integration.

  • Project mention: Bioinformatics Data Structures in Rust | news.ycombinator.com | 2023-08-23
  • bwa

    Burrow-Wheeler Aligner for short-read alignment (see minimap2 for long-read alignment)

  • Project mention: my friend showed me his code, thees are all functions | /r/ProgrammerHumor | 2023-05-25

    Check out these sweet macros (from the same repo): https://github.com/lh3/bwa/blob/139f68fc4c3747813783a488aef2adc86626b01b/kbtree.h

  • galaxy

    Data intensive science for everyone.

  • Project mention: Need for GUIs for bioinformatic tools? | /r/bioinformatics | 2023-06-17

    Maybe it would help you to look at the galaxy project: GitHub main site

  • seqtk

    Toolkit for processing sequences in FASTA/Q formats

  • Project mention: Illumina adapters and quality trimming | /r/bioinformatics | 2023-07-04

    seqtk: A lightweight and versatile tool for processing FASTQ and FASTA files. https://github.com/lh3/seqtk

  • MMseqs2

    MMseqs2: ultra fast and sensitive search and clustering suite

  • seqkit

    A cross-platform and ultrafast toolkit for FASTA/Q file manipulation

  • Project mention: A look at the Mojo language for bioinformatics | news.ycombinator.com | 2024-02-11

    I've been thinking to learn Rust for these use cases, but always get frustrated with the complexity.

    I find Go is a great middle-ground though! And now there starts to be a few more bio-related tools and toolkits out there, including:

    - https://github.com/vertgenlab/gonomics

    - https://github.com/biogo/biogo

    - https://github.com/shenwei356/bio

    ... except from there being some really popular bio tools written in Go, like:

    - https://github.com/shenwei356/seqkit

  • MultiQC

    Aggregate results from bioinformatics analyses across many samples into a single report.

  • Project mention: R pipelines for bulk RNA-seq analyses | /r/bioinformatics | 2023-12-09

    fastp + multiQC + Salmon + DESeq2 all some nextflow workflow. It is a good exercise (not complicated) to create the pipeline from scratch the first time to properly understand each tool.

  • deep_gcns_torch

    Pytorch Repo for DeepGCNs (ICCV'2019 Oral, TPAMI'2021), DeeperGCN (arXiv'2020) and GNN1000(ICML'2021): https://www.deepgcns.org

  • getting-started-with-genomics-tools-and-resources

    Unix, R and python tools for genomics and data science

  • scipipe

    Robust, flexible and resource-efficient pipelines using Go and the commandline

  • react-plotly.js

    A plotly.js React component from Plotly 📈

  • SaaSHub

    SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives

    SaaSHub logo
NOTE: The open source projects on this list are ordered by number of github stars. The number of mentions indicates repo mentiontions in the last 12 Months or since we started tracking (Dec 2020).

Bioinformatics related posts

Index

What are some of the best open-source Bioinformatic projects? This list will help you:

Project Stars
1 cs-video-courses 64,788
2 dash 20,472
3 Biopython 4,158
4 deepvariant 3,076
5 awesome-single-cell 2,898
6 Awesome-Bioinformatics 2,874
7 nextflow 2,538
8 fastp 1,760
9 scanpy 1,745
10 minimap2 1,684
11 scispacy 1,612
12 bioconda-recipes 1,565
13 Rust-Bio 1,494
14 bwa 1,443
15 galaxy 1,313
16 seqtk 1,308
17 MMseqs2 1,246
18 seqkit 1,202
19 MultiQC 1,161
20 deep_gcns_torch 1,104
21 getting-started-with-genomics-tools-and-resources 1,058
22 scipipe 1,054
23 react-plotly.js 990

Sponsored
SaaSHub - Software Alternatives and Reviews
SaaSHub helps you find the best software and product alternatives
www.saashub.com