diamond vs edlib

diamond

Accelerated BLAST compatible local sequence aligner. (by bbuchfink)

edlib

Lightweight, super fast C/C++ (& Python) library for sequence alignment using edit (Levenshtein) distance. (by Martinsos)

sequence-alignment edit-distance levehnstein-distance Library C++ alignment-path Python Bioinformatics

Source Code

martinsos.github.io

Suggest alternative

Edit details

InfluxDB - Power Real-Time Data Analytics at Scale

Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

www.influxdata.com

featured

SaaSHub - Software Alternatives and Reviews

SaaSHub helps you find the best software and product alternatives

www.saashub.com

featured

diamond		edlib
	Project
3	Mentions	2
975	Stars	484
-	Growth	-
6.3	Activity	1.1
4 months ago	Latest Commit	about 1 year ago
C++	Language	C++
GNU General Public License v3.0 only	License	MIT License

The number of mentions indicates the total number of mentions that we've tracked plus the number of user suggested alternatives.
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.

diamond

Posts with mentions or reviews of diamond. We have used some of these posts to build our list of alternatives and similar projects. The last one was on 2023-05-12.

comparing the similarity between a set of protein sequences
2 projects | /r/genomics | 12 May 2023

Diamond (https://github.com/bbuchfink/diamond) might help. It has a protein sequence clustering option. You could cluster your sequences and then take the centroids of each cluster. Vary the BLAST parameters to increase/decrease the numbers of clusters.
which database is best to use on BLAST to identify an unknown protein?
1 project | /r/bioinformatics | 6 Nov 2022

What I usually do is the DIAMOND search (https://github.com/bbuchfink/diamond) on UniRef (50/90) database.
someone remotely helped me to download and execute this file called a diamond.exe from the following link: https://github.com/bbuchfink/diamond Windows said it could be unsafe so I disabled Windows Defender and pressed run on it but it didn't do anything, is this a virus is it safe?
1 project | /r/genetics | 27 Jun 2022

someone remotely helped me to download and execute this file called a diamond.exe from the following link: https://github.com/bbuchfink/diamond Windows said it could be unsafe so I disabled Windows Defender and pressed run on it but it didn't do anything, is this a virus is it safe?

edlib

Posts with mentions or reviews of edlib. We have used some of these posts to build our list of alternatives and similar projects. The last one was on 2022-01-03.

What's an efficient way to find multiple subsequences in several FASTQs?
1 project | /r/bioinformatics | 8 Feb 2022

I’ve got a similar situation. I was implementing the Smith-Waterman algorithm when I figured someone had to have already written a “fast” version of this. I found the edlib package (https://github.com/Martinsos/edlib) which does sequence alignment using Levenshtein distance. Essentially same DP algorithm as your traditional NW or SW only this is a C++ implementation with a Python wrapper. (I’m assuming you’re using Python, could be wrong though). The pertinent aspects of the output of this function contains the distance (dissimilarity) and the location (what index does the alignment start and end). This tool may go a ways to helping your pipeline. You could also look to metagenomic papers for inspiration as this is a problem (find a substring in a huge amount of data) that the community contends with all the time. Kmer based approach may also be useful if you want to attempt the alignment free path. Cheers.
ModuleNotFoundError after running `pip install -e .` locally
2 projects | /r/learnpython | 3 Jan 2022

I appear to get that error with the original source as well. https://github.com/Martinsos/edlib

What are some alternatives?

When comparing diamond and edlib you can also consider the following projects:

Biopython - Official git repository for Biopython (originally converted from CVS)

seq - A high-performance, Pythonic language for bioinformatics

seqan3 - The modern C++ library for sequence analysis. Contains version 3 of the library and API docs.

bwa-mem2 - The next version of bwa-mem

nanopolish - Signal-level algorithms for MinION data

libnitrokey - Communicate with Nitrokey devices in a clean and easy manner

casadi - CasADi is a symbolic framework for numeric optimization implementing automatic differentiation in forward and reverse modes on sparse matrix-valued computational graphs. It supports self-contained C-code generation and interfaces state-of-the-art codes such as SUNDIALS, IPOPT etc. It can be used from C++, Python or Matlab/Octave.

frugally-deep - A lightweight header-only library for using Keras (TensorFlow) models in C++.

edlibtest - Private changes to https://github.com/Martinsos/edlib