biofast
nimtorch
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
biofast
-
Parsing huge files in Python
FYI: the python packages I mentioned earlier can all directly read gzip'd fastq files. See also this repo for examples.
-
Does Rust Support Reading in FATSA files?
needletail is rated in the Heng Li benchmark (https://github.com/lh3/biofast/)
- Why I Use Nim instead of Python for Data Processing
nimtorch
- NimTorch: PyTorch front end by generating C++ native ATen code
- Nim Front End for Pytirch
-
Why I Use Nim instead of Python for Data Processing
There’s been a tremendous amount of work optimizing blas _and_ ensuring it’s numerically stable. Julia made a good choice to use blas first. Though it’s good to see new native implementations.
For Nim, there’s also NimTorch which is interesting in that it builds on Nim’s C++ target to generate native PyTorch code. Even Python is technically a second class citizen for the C++ code. Most ML libraries are C++ all the way down.
https://github.com/sinkingsugar/nimtorch
What are some alternatives?
scikit-bio - scikit-bio: a community-driven Python library for bioinformatics, providing versatile data structures, algorithms and educational resources.
Arraymancer - A fast, ergonomic and portable tensor library in Nim with a deep learning focus for CPU, GPU and embedded devices via OpenMP, Cuda and OpenCL backends
readfq - Fast multi-line FASTA/Q reader in several programming languages
PrimesResult - The results of the Dave Plummer's Primes Drag Race
viroiddb - A curated database of all available viroid-like RNA sequences
RecursiveFactorization.jl
nimpy - Nim - Python bridge
benchmarks - Some benchmarks of different languages
nimpylib - Some python standard library functions ported to Nim