sra-tools
bwa
sra-tools | bwa | |
---|---|---|
8 | 3 | |
1,054 | 1,454 | |
1.4% | - | |
9.2 | 2.7 | |
6 days ago | about 1 month ago | |
C | C | |
GNU General Public License v3.0 or later | GNU General Public License v3.0 only |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
sra-tools
-
Bulk RNA seq analysis.
Note that conda sra-tools does not support ARM (M chips use customized ARM architecture ) yet and the official repo also does not provide pre-compiled version for arm. You should be able to compile it from the source but YMMV. Similarly, STAR does not officially support ARM either but compilation on your own always worths a try.
-
fasterq-dump target disk-limit
According to this, the disk space required should be about 17 times the size of the accession (for both output files and tmp files). As can be seen below, I have >500GB of disk space. Yet, I receive this output:
-
Scientists Are Finding Fungi in Cancerous Tumors
Start with downloading SRA toolkit: https://github.com/ncbi/sra-tools/wiki/02.-Installing-SRA-To...
Find some data of interest: https://www.ncbi.nlm.nih.gov/sra?term=(%22Homo%20sapiens%22[... (This searches SRA for human genome sequences on illumina with fastq files available)
Run fasterq-dump on the SRR (listed as "Runs" in the SRA page of your choice):
-
Systematic way to collect GEO datasets
If you are okay with looking at already processed data, chdck out https://dee2.io/. Otherwise there is https://github.com/ncbi/sra-tools for getting fastq files (a cli tool) or https://github.com/saketkc/pysradb (python)
-
How to extract gene sequences from SRA
Fasterq-dump also doesnt play nice sometimes. By that I mean it causes problems like this https://github.com/ncbi/sra-tools/issues/383
-
How do I enable prefetch? Keep getting "-bash: defaults: command not found"
In this case, you're missing software. Specifically, you're missing sra-tools. Fortunately, this is provided via Homebrew as sratoolkit. After installing that formula, you should be able to use the prefetch command.
-
SRAToolKit for pipeline
Yeah, unfortunately this required key entry is by design for some reason (see more here https://github.com/ncbi/sra-tools/issues/291).
- SRA data from NCBI - moved?
bwa
-
my friend showed me his code, thees are all functions
Check out these sweet macros (from the same repo): https://github.com/lh3/bwa/blob/139f68fc4c3747813783a488aef2adc86626b01b/kbtree.h
- Scientists Are Finding Fungi in Cancerous Tumors
- BurrowsâWheeler Transform
What are some alternatives?
biostar-central - Biostar Q&A
foldseek - Foldseek enables fast and sensitive comparisons of large structure sets.
pysradb - Package for fetching metadata and downloading data from SRA/ENA/GEO
slivar - genetic variant expressions, annotation, and filtering for great good.
HomeBrew - đș The missing package manager for macOS (or Linux)
MethylDackel - A (mostly) universal methylation extractor for BS-seq experiments.
bowtie - An ultrafast memory-efficient short read aligner
spades - SPAdes Genome Assembler
libBigWig - A C library for handling bigWig files
javascript-obfuscator - A powerful obfuscator for JavaScript and Node.js