diamond
minimap2
diamond | minimap2 | |
---|---|---|
3 | 5 | |
969 | 1,698 | |
- | - | |
6.3 | 7.6 | |
3 months ago | 6 days ago | |
C++ | C | |
GNU General Public License v3.0 only | GNU General Public License v3.0 or later |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
diamond
-
comparing the similarity between a set of protein sequences
Diamond (https://github.com/bbuchfink/diamond) might help. It has a protein sequence clustering option. You could cluster your sequences and then take the centroids of each cluster. Vary the BLAST parameters to increase/decrease the numbers of clusters.
-
which database is best to use on BLAST to identify an unknown protein?
What I usually do is the DIAMOND search (https://github.com/bbuchfink/diamond) on UniRef (50/90) database.
-
someone remotely helped me to download and execute this file called a diamond.exe from the following link: https://github.com/bbuchfink/diamond Windows said it could be unsafe so I disabled Windows Defender and pressed run on it but it didn't do anything, is this a virus is it safe?
someone remotely helped me to download and execute this file called a diamond.exe from the following link: https://github.com/bbuchfink/diamond Windows said it could be unsafe so I disabled Windows Defender and pressed run on it but it didn't do anything, is this a virus is it safe?
minimap2
-
Ask HN: Comment here about whatever you're passionate about at the moment
Interested as well! But the future is not so dark, things like e.g. https://github.com/lh3/minimap2 are a breath of fresh air.
-
BLAST 10,000 genes?
No experience with this maybe try minimap2(https://github.com/lh3/minimap2) if this doesn't work fall back on blast/blat
- Truncating genome fastas to just overlapping regions
-
Alignment of long reads to plasmid and generation of consensus sequence.
You can try minimap2 to align your long reads to your expected plasmid
-
Questions about WGS mapping
It sounds like the mapping wasn't very good, you might want to try minimap2 as it is a newer algorithm.
What are some alternatives?
Biopython - Official git repository for Biopython (originally converted from CVS)
bwa-mem2 - The next version of bwa-mem
seqan3 - The modern C++ library for sequence analysis. Contains version 3 of the library and API docs.
seqtk - Toolkit for processing sequences in FASTA/Q formats
seqstats - Quick summary statistics on fasta/fastq(.gz) files
slivar - genetic variant expressions, annotation, and filtering for great good.
MethylDackel - A (mostly) universal methylation extractor for BS-seq experiments.
bwa-mem2 - The next version of bwa-mem
Klib - A standalone and lightweight C library
Bitgrid - Bitgrid - a new model of computation
biowasm - WebAssembly modules for genomics
miniprot - Align proteins to genomes with splicing and frameshift