Bioinformatics Software Developer Tools / Infrastructure Ideas

This page summarizes the projects mentioned and recommended in the original post on /r/bioinformatics

Our great sponsors
  • InfluxDB - Power Real-Time Data Analytics at Scale
  • WorkOS - The modern identity platform for B2B SaaS
  • SaaSHub - Software Alternatives and Reviews
  • htscodecs

    Custom compression for CRAM and others.

  • If I was pitching more at graduate level I'd say write the container format only and call the htscodecs reference implementations for the compression (just the custom codecs, the regular ones GZIP, LZMA and BZ2 algorithms will already have implementations in most languages). That would probably be a masters thesis (6 months) level project with proper supervision.

  • Plausible Analytics

    Simple, open source, lightweight (< 1 KB) and privacy-friendly web analytics alternative to Google Analytics.

  • Also, I don't think software developing tools are perfect as they are constantly evolving with new market players entering every year. Amazon MTurk sucks so why there are new data labeling companies. Amazon Redshift also was kind of painful to use, the reason why Snowflake got so popular. I also think there are people who don't like to use Google Analytics, so services like https://plausible.io exist. Google/Apple's in app purchase sucks, so RevenueCat came out. Also, my most recent favorite one is https://www.ray.io, which is solving scaling distributed ML (deploying prototyped pytorch code in a production env is painful and takes so much effort).

  • InfluxDB

    Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

    InfluxDB logo
NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a more popular project.

Suggest a related project

Related posts