Python Genomics

Open-source Python projects categorized as Genomics

Top 23 Python Genomic Projects

  1. Biopython

    Official git repository for Biopython (originally converted from CVS)

    Project mention: How to Start Contributing to Open Source Software | dev.to | 2024-10-17

    I also like contributing specifically to my field. As a PhD student and possibly future scientist, I have a vested interest in the quality of the software in my field–specifically, structural bioinformatics. I use several tools in this field and often find areas that can be improved, both for myself and others. As an example, consider this minor documentation change I added to the Biopython documentation.

  2. CodeRabbit

    CodeRabbit: AI Code Reviews for Developers. Revolutionize your code reviews with AI. CodeRabbit offers PR summaries, code walkthroughs, 1-click suggestions, and AST-based analysis. Boost productivity and code quality across all major languages with each PR.

    CodeRabbit logo
  3. deepvariant

    DeepVariant is an analysis pipeline that uses a deep neural network to call genetic variants from next-generation DNA sequencing data.

  4. galaxy

    Data intensive science for everyone.

  5. Hail

    Cloud-native genomic dataframes and batch computing

    Project mention: Ask HN: Who is hiring? (October 2024) | news.ycombinator.com | 2024-10-01
  6. ncbi-genome-download

    Scripts to download genomes from the NCBI FTP servers

  7. pyCirclize

    Circular visualization in Python (Circos Plot, Chord Diagram, Radar Chart)

    Project mention: Circular Data Visualization in Python | news.ycombinator.com | 2024-07-09
  8. goatools

    Python library to handle Gene Ontology (GO) terms

  9. SaaSHub

    SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives

    SaaSHub logo
  10. eggnog-mapper

    Fast genome-wide functional annotation through orthology assignment

  11. nucleotide-transformer

    🧬 Nucleotide Transformer: Building and Evaluating Robust Foundation Models for Human Genomics

    Project mention: Nucleotide Transformer: building robust foundation models for human genomics | news.ycombinator.com | 2024-12-07

    Any given segment of DNA within a gene can classified as ignored, transcribed into RNA and spliced into another segment, can signal to start or end a splice, can signal the start or end of the protein, can be a regulatory switch for other parts of DNA, and more...

    This LLM takes a DNA sequence and assigns probabilities for each of those classifications: https://github.com/instadeepai/nucleotide-transformer .

  12. enformer-pytorch

    Implementation of Enformer, Deepmind's attention network for predicting gene expression, in Pytorch

  13. pyfaidx

    Efficient pythonic random access to fasta subsequences

  14. DNA-Diffusion

    🧬 Generative modeling of regulatory DNA sequences with diffusion probabilistic models 💨

  15. truvari

    Structural variant toolkit for VCFs

  16. pyGenomeViz

    A genome visualization python package for comparative genomics

  17. hgvs

    Python library to parse, format, validate, normalize, and map sequence variants. `pip install hgvs`

  18. Scoary

    Pan-genome wide association studies

  19. ariba

    Antimicrobial Resistance Identification By Assembly

  20. veba

    A modular end-to-end suite for in silico recovery, clustering, and analysis of prokaryotic, microeukaryotic, and viral genomes from metagenomes

  21. ClipKIT

    a multiple sequence alignment-trimming algorithm for accurate phylogenomic inference

  22. covid-19-genomes

    Projects on COVID-19 topic of genomic sequencing - mostly DataViz

  23. UPIMAPI

    UniProt Id Mapping through API

  24. arcsv

    Complex structural variant detection from WGS data

  25. enrich_omics

    A python package to explore pathways, diseases and drugs associated to a list of targets (genes, proteins, etc)

  26. SaaSHub

    SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives

    SaaSHub logo
NOTE: The open source projects on this list are ordered by number of github stars. The number of mentions indicates repo mentiontions in the last 12 Months or since we started tracking (Dec 2020).

Python Genomics discussion

Log in or Post with

Python Genomics related posts

  • Invitación a proyecto - Biopython en Español

    1 project | /r/devsarg | 23 Jul 2023
  • Snakemake – A framework for reproducible data analysis

    6 projects | news.ycombinator.com | 15 Jul 2023
  • Biopython – Python Tools for Computational Molecular Biology

    1 project | news.ycombinator.com | 28 May 2023
  • comparing the similarity between a set of protein sequences

    2 projects | /r/genomics | 12 May 2023
  • Look over my purchase, is there anything I should return?

    2 projects | /r/buildapc | 6 May 2023
  • u/Responsible-Gas3852 comments on "Why is Cancer so Hard to Cure?"

    2 projects | /r/bestof | 13 Apr 2023
  • My boss is considering letting me take a programming course if I have some good reasons why.

    2 projects | /r/labrats | 13 Apr 2023
  • A note from our sponsor - SaaSHub
    www.saashub.com | 18 Mar 2025
    SaaSHub helps you find the best software and product alternatives Learn more →

Index

What are some of the best open-source Genomic projects in Python? This list will help you:

# Project Stars
1 Biopython 4,524
2 deepvariant 3,341
3 galaxy 1,464
4 Hail 997
5 ncbi-genome-download 988
6 pyCirclize 833
7 goatools 808
8 eggnog-mapper 596
9 nucleotide-transformer 579
10 enformer-pytorch 465
11 pyfaidx 466
12 DNA-Diffusion 382
13 truvari 344
14 pyGenomeViz 312
15 hgvs 260
16 Scoary 190
17 ariba 172
18 veba 83
19 ClipKIT 70
20 covid-19-genomes 52
21 UPIMAPI 31
22 arcsv 28
23 enrich_omics 19

Sponsored
CodeRabbit: AI Code Reviews for Developers
Revolutionize your code reviews with AI. CodeRabbit offers PR summaries, code walkthroughs, 1-click suggestions, and AST-based analysis. Boost productivity and code quality across all major languages with each PR.
coderabbit.ai

Did you know that Python is
the 2nd most popular programming language
based on number of references?