-
perl-for-reysenbach-lab
These are perl scripts I developed over many years as a Bioinformaticist for the Reysenbach Lab at PSU. The Reysenbach Lab studies microbial diversity in extreme environments. Lotta fasta utilities here if you are into that sort of thing.
-
python-bioinformatics
Here are five of my bioinformatics scripts that have been converted from Perl to Python... Also a script for creating histograms of GC content.
-
InfluxDB
Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
In 2016 I wrote a Perl script for the Reysenbach lab that calculates k-mer frequencies, for all k-mers in a given sequence. It was made public soon after, and you can find get_kmer_frequencies.pl on my Github page.
This script is available for download on my GitHub page. To test for speed, I ran these two scripts five times each with two different sequence examples; a full genome file containing one long fasta record, and a Metagenome Assembled Genome (MAG) file containing 1708 shorter fasta records. Python was 2.9 times faster than Perl for the full genome, and 2.1 times faster than Perl for the MAG. I’ve recently become aware of the optimizing compiler RPerl, which allows Perl to be compiled into “ultra-fast and fully-compatible” C++, so I’m excited to try this out and see how the results compare to both Perl and Python.
Related posts
-
Pgdsat – Postgres database security assessment tool for CIS benchmarks
-
I don't want to fill out your contact form
-
Desert greening is transforming dust and sand into farmable soil (2023)
-
87.7% of entrepreneurs struggle with at least one mental health issue
-
Alice's Adventures in a Differentiable Wonderland