Top 21 C++ Bioinformatic Projects

fastp

9 1,751 3.5 C++

An ultra-fast all-in-one FASTQ preprocessor (QC/adapters/trimming/filtering/splitting/merging...)

Project mention: R pipelines for bulk RNA-seq analyses | /r/bioinformatics | 2023-12-09

fastp + multiQC + Salmon + DESeq2 all some nextflow workflow. It is a good exercise (not complicated) to create the pipeline from scratch the first time to properly understand each tool.
salmon

1 725 5.0 C++

🐟 🍣 🍱 Highly-accurate & wicked fast transcript-level quantification from RNA-seq reads using selective alignment (by COMBINE-lab)
InfluxDB

www.influxdata.com
sponsored

Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
bwa-mem2

2 673 2.6 C++

The next version of bwa-mem
bowtie2

2 615 7.6 C++

A fast and sensitive gapped read aligner

Project mention: NHI Genome Studies: Mexico Govt Sept 12 Congressional hearing | /r/genetics | 2023-09-14

2) Use bowtie2 to align reads against CHM13. This will let you separate human from nonhuman (important, as human sequences are a common contaminant in many nonhuman genomes).
megahit

1 554 0.0 C++

Ultra-fast and memory-efficient (meta-)genome assembler
nanopolish

1 539 5.2 C++

Signal-level algorithms for MinION data
edlib

2 484 1.1 C++

Lightweight, super fast C/C++ (& Python) library for sequence alignment using edit (Levenshtein) distance.
WorkOS

workos.com
sponsored

The modern identity platform for B2B SaaS. The APIs are flexible and easy-to-use, supporting authentication, user identity, and complex enterprise features like SSO and SCIM provisioning.
hap.py

2 391 0.0 C++

Haplotype VCF comparison tools
seqan3

1 385 8.9 C++

The modern C++ library for sequence analysis. Contains version 3 of the library and API docs.
octopus

1 295 4.0 C++

Bayesian haplotype-based mutation calling (by luntergroup)
bowtie

1 252 10.0 C++

An ultrafast memory-efficient short read aligner
amr

1 232 7.5 C++

AMRFinderPlus - Identify AMR genes and point mutations, and virulence and stress resistance genes in assembled bacterial nucleotide and protein sequence.
GenomicSQLite

1 152 3.4 C++

Genomics Extension for SQLite
rnaseqc

1 140 4.2 C++

Fast, efficient RNA-Seq metrics for quality control and process optimization
SnakeStrike

1 86 0.0 C++

A Low-cost Open-source High-speed Multi-camera Motion Capture System.
TileDB-VCF

4 79 8.3 C++

Efficient variant-call data storage and retrieval library using the TileDB storage library.
sshash

1 77 7.3 C++

A compressed, associative, exact, and weighted dictionary for k-mers.
Metabuli

1 62 9.1 C++

Metabuli: specific and sensitive metagenomic classification via joint analysis of DNA and amino acid.

Project mention: Genomic sequence classification based on k-mers distribution | /r/bioinformatics | 2023-12-04
IntaRNA

1 44 6.6 C++

Efficient target prediction incorporating accessibility of interaction sites
pRIblast

1 2 4.7 C++

pRIblast is a high efficient, parallel application for extensive lncRNA-RNA interaction analysis
kmer-signatures

1 0 3.6 C++

High-performance kmer-signatures
SaaSHub

www.saashub.com
sponsored

SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives

NOTE: The open source projects on this list are ordered by number of github stars. The number of mentions indicates repo mentiontions in the last 12 Months or since we started tracking (Dec 2020). The latest post mention was on 2023-12-09.

C++ Bioinformatics related posts

Ask HN: Comment here about whatever you're passionate about at the moment
17 projects | news.ycombinator.com | 6 Nov 2023
NHI Genome Studies: Mexico Govt Sept 12 Congressional hearing
4 projects | /r/genetics | 14 Sep 2023
Illumina's Manta candidateSmallIndels.vcf.gz Fed Into Illumina's Strelka Using What Sample's candidateSmallIndels.vcf.gz Experimental/Tumor or Normal?
2 projects | /r/bioinformatics | 20 Jun 2023
What to do with Manta outputs
2 projects | /r/bioinformatics | 23 May 2023
Help running hap.py
1 project | /r/bioinformatics | 22 Nov 2022
Anyone use DRAGEN-GATK?
1 project | /r/bioinformatics | 12 Oct 2022
Tools for strand direction detection RNA-Seq
2 projects | /r/bioinformatics | 8 Oct 2022
A note from our sponsor - InfluxDB
www.influxdata.com | 17 Apr 2024

Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality. Learn more →

Index

What are some of the best open-source Bioinformatic projects in C++? This list will help you:

	Project	Stars
1	fastp	1,751
2	salmon	725
3	bwa-mem2	673
4	bowtie2	615
5	megahit	554
6	nanopolish	539
7	edlib	484
8	hap.py	391
9	seqan3	385
10	octopus	295
11	bowtie	252
12	amr	232
13	GenomicSQLite	152
14	rnaseqc	140
15	SnakeStrike	86
16	TileDB-VCF	79
17	sshash	77
18	Metabuli	62
19	IntaRNA	44
20	pRIblast	2
21	kmer-signatures	0