edlib
Lightweight, super fast C/C++ (& Python) library for sequence alignment using edit (Levenshtein) distance. (by Martinsos)
seq
A high-performance, Pythonic language for bioinformatics (by seq-lang)
edlib | seq | |
---|---|---|
2 | 15 | |
484 | 634 | |
- | - | |
1.1 | 0.7 | |
about 1 year ago | over 1 year ago | |
C++ | C++ | |
MIT License | Apache License 2.0 |
The number of mentions indicates the total number of mentions that we've tracked plus the number of user suggested alternatives.
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
edlib
Posts with mentions or reviews of edlib.
We have used some of these posts to build our list of alternatives
and similar projects. The last one was on 2022-01-03.
-
What's an efficient way to find multiple subsequences in several FASTQs?
I’ve got a similar situation. I was implementing the Smith-Waterman algorithm when I figured someone had to have already written a “fast” version of this. I found the edlib package (https://github.com/Martinsos/edlib) which does sequence alignment using Levenshtein distance. Essentially same DP algorithm as your traditional NW or SW only this is a C++ implementation with a Python wrapper. (I’m assuming you’re using Python, could be wrong though). The pertinent aspects of the output of this function contains the distance (dissimilarity) and the location (what index does the alignment start and end). This tool may go a ways to helping your pipeline. You could also look to metagenomic papers for inspiration as this is a problem (find a substring in a huge amount of data) that the community contends with all the time. Kmer based approach may also be useful if you want to attempt the alignment free path. Cheers.
-
ModuleNotFoundError after running `pip install -e .` locally
I appear to get that error with the original source as well. https://github.com/Martinsos/edlib
seq
Posts with mentions or reviews of seq.
We have used some of these posts to build our list of alternatives
and similar projects. The last one was on 2021-09-20.
- Bioinformatics programming language
-
A Python-based programming language for high-performance computational genomics
> Seq is a Python-compatible language, and the vast majority of Python programs should work without any modifications
https://github.com/seq-lang/seq
- Seq – A programming language for computational genomics and bioinformatics
-
Hacker News top posts: Sep 15, 2021
Seq: A programming language for high-performance computational genomics\ (17 comments)
-
Seq: A programming language for high-performance computational genomics
They support both, and will deprecate Python 2 style soon.
https://github.com/seq-lang/seq/issues/223
What are some alternatives?
When comparing edlib and seq you can also consider the following projects:
bwa-mem2 - The next version of bwa-mem
Nim - Nim is a statically typed compiled systems programming language. It combines successful concepts from mature languages like Python, Ada and Modula. Its design focuses on efficiency, expressiveness, and elegance (in that order of priority).