NHI Genome Studies: Mexico Govt Sept 12 Congressional hearing

This page summarizes the projects mentioned and recommended in the original post on /r/genetics

Our great sponsors
  • InfluxDB - Power Real-Time Data Analytics at Scale
  • WorkOS - The modern identity platform for B2B SaaS
  • SaaSHub - Software Alternatives and Reviews
  • fastp

    An ultra-fast all-in-one FASTQ preprocessor (QC/adapters/trimming/filtering/splitting/merging...)

  • 1) QC the data with fastp. This'll trim out adapters and toss reads that are poor quality.

  • bowtie2

    A fast and sensitive gapped read aligner

  • 2) Use bowtie2 to align reads against CHM13. This will let you separate human from nonhuman (important, as human sequences are a common contaminant in many nonhuman genomes).

  • InfluxDB

    Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

    InfluxDB logo
  • CHM13

    The complete sequence of a human genome

  • 2) Use bowtie2 to align reads against CHM13. This will let you separate human from nonhuman (important, as human sequences are a common contaminant in many nonhuman genomes).

  • kraken2

    The second version of the Kraken taxonomic sequence classification system

  • 3) Use Kraken2 to classify remaining reads. I'd start with the standard database.

NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a more popular project.

Suggest a related project

Related posts