Top 17 Biology Open-Source Projects
Democratizing Deep-Learning for Drug Discovery, Quantum Chemistry, Materials Science and BiologyProject mention: How do I transition into bioinformatics from a senior software engineer (14 years of experience)? | reddit.com/r/bioinformatics | 2021-05-23
Project Home repo for Central Dockerfile and Project-wide issuesProject mention: I just learned about the OpenWorm project. Does this have any implications for the philosophy of consciousness? | reddit.com/r/askphilosophy | 2021-06-03
Scout APM: A developer's best friend. Try free for 14-days. Scout APM uses tracing logic that ties bottlenecks to source code so you know the exact line of code causing performance issues and can get back to building a great product faster.
The main repository for the development of the evolution game Thrive.Project mention: A few questions about evolution. | reddit.com/r/evolution | 2021-07-19
There is a game trying to make a more complex spore simulation called Thrive. Its free2play but currently only has the first, microbial stage. https://revolutionarygamesstudio.com/
Scripts to download genomes from the NCBI FTP serversProject mention: Downloading genomes from database via command line FTP | reddit.com/r/bioinformatics | 2021-07-16
I know you said Ensembl, but if you can live with NCBI, I would suggest https://github.com/kblin/ncbi-genome-download
C++ library and cmdline tools for parsing and manipulating VCF files
SeqAn's official repository.
Chemical reaction network and systems biology interface for scientific machine learning (SciML). High performance, GPU-parallelized, and O(1) solvers in open source softwareProject mention: Should I switch over completely to Julia from Python for numerical analysis/computing? | reddit.com/r/Julia | 2021-07-08
ModelingToolkit.jl adds a different spin on this by noting what makes a good modeling system isn't top down but a system that allows for bottom up contributions. ModelingToolkit is built on Symbolics.jl which uses OSCAR.jl etc., so every time the symbolics community gets better ModelingToolkit.jl gets better. It connects to the whole SciML ecosystem, so any improvement to any of the SciML interface packages is directly an improvement to ModelingToolkit.jl. ModelingToolkit is made to be a set of composable compiler abstractions called transformations, so anyone can add new packages that do new transformations that improve the ecosystem. One that I really like is MomentClosure.jl which symbolically transforms stochastic ModelingToolkit models (ReactionSystem) to approximate symbolic ODESystem models of the moments. And there's domain-specific langauges like Catalyst.jl being built on the interface to give more ways to build models, which is spawning the biocommunity to make model importers into the symbolic forms, when then feeds more ODE models into the same compiler. JuliaSim is then building on this ecosystem, adding cloud infrastructure that is special-purpose made for doing parallel computations of these models, automatic symbolic model discovery from data, automatic generation of approximate models with machine learning, and tying the Julia Computing compiler team into the web that is building this ecosystem.
SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives
A Python Library for Conway's Game of LifeProject mention: Which language do you use to code cellular automata? | reddit.com/r/cellular_automata | 2021-05-24
Python! I even made a small library to do it: https://github.com/ljvmiranda921/seagull
A Go package for engineering organisms.Project mention: Ask HN: What's an interesting DIY genetic engineering project? | news.ycombinator.com | 2021-06-11
I have experience in DIY genetic engineering (have run a DIY home genetic engineering lab for almost 10 years now https://www.nytimes.com/2018/05/14/science/biohackers-gene-e... )
What you can do and what you can do are different things. Genetic engineering and biological manipulation go as deep as software, and tacit knowledge about execution is non-trivial to the point where you WILL mess up experiments (so expect to repeat a lot).
That said, you can still do some fun stuff. I would recommend trying to do something very small but actually novel. For example, if you've done a GFP transformation into E.coli, try to get the GFP transformation working in a new organism (maybe a yogurt bacteria). Keep it small though, and keep it single cellular, or else you are putting yourself into the pit of despair.
Also check out the Poly project (https://github.com/TimothyStiles/poly). We're basically building (decent) open-source software for doing synthetic biology. Since you're a software developer, doing code reviews and reading our mega-comments (like https://github.com/TimothyStiles/poly/blob/prime/transformat...) might help you understand some more of the fundamental engineering problems we synthetic biologists are encountering. Also, in code reviews, if you don't understand something, a practicing synthetic biologist will explain it to you so that we can improve our docs.
Data structures for double stranded DNA & simulation of homologous recombination, Gibson assembly, cut & paste cloning in Python and Jupyter notebooks.Project mention: Most optimal programming language for the field of genetics? | reddit.com/r/biotech | 2021-02-25
Biological sequences for the julia languageProject mention: Learning which programming language will make me the most accessible in bioinformatics community? (if there's any) | reddit.com/r/bioinformatics | 2021-04-20
Structural variant detection and association testing
BioAmp EXG Pill is a small and elegant Analog Front End (AFE) board for BioPotential signal acquisition.Project mention: Anyone know of any very cheap / DIY EEG contraptions? | reddit.com/r/compmathneuro | 2021-03-22
libsequence: a C++ class library for evolutionary genetic analysis
The Aquarium Lab Operating SystemProject mention: New Study Explains How to Engineer the Coronavirus + All Other Synthetic Biology Research This Week | reddit.com/r/biotech | 2021-02-01
From the methods section of the Aquarium paper: Aquarium is distributed under the open-source MIT license. Aquarium, documentation, and installation instructions are freely available (https://www.aquarium.bio/) along with links to Dockerized versions of the software. Code is maintained on Github (https://github.com/aquariumbio/aquarium). Aquarium’s Python API (Trident) is also under the open-source MIT license and is hosted on the open-source python repository at PyPI (https://pypi.org/project/pydent/) and its documentation and installation instructions are also freely available (https://aquariumbio.github.io/trident/).
Cellular Automata SimulatorProject mention: Where can I learn how to use Golly? | reddit.com/r/cellular_automata | 2021-04-14
An open-access bioinformatics textProject mention: Beginner's bioinformatics books for someone without any knowledge in biology? | reddit.com/r/bioinformatics | 2021-05-13
A former advisor of mine wrote this up a bit ago and I felt it was solid (also a CS major but did some bio work and his text was written with that in mind). https://github.com/zaneveld/full_spectrum_bioinformatics
What are some of the best open-source Biology projects? This list will help you: