-
InfluxDB
Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
I have developed a pipeline (https://github.com/iquasere/MOSCA) that performs integrated MG and MT analysis, using similar steps to those you have listed. If you provide it only MT, it will assemble the MT using Trinity (MEGAHIT was developed for MG analysis), but it is preferable that you use a reference metagenomics dataset (you can also input it that), so you obtain better contigs, better genes and a better reference for your MT. All this is obtained with MOSCA.
Prediction of coding sequences takes as input the contigs you obtained, and gives you the translated genes. Besides annotating with the KEGG database, you may also want to annotate with more general purpose databases (e.g. UniProt), as these provide more taxonomies and functional information. MOSCA includes UPIMAPI (https://github.com/iquasere/UPIMAPI) and reCOGnizer (https://github.com/iquasere/reCOGnizer), which annotate genes with reference to UniProt and CDD databases using two different methods, providing complementary information. This is the same methodology used by widely popular tools such as eggNOG-mapper and Prokka, but these use other databases.
Prediction of coding sequences takes as input the contigs you obtained, and gives you the translated genes. Besides annotating with the KEGG database, you may also want to annotate with more general purpose databases (e.g. UniProt), as these provide more taxonomies and functional information. MOSCA includes UPIMAPI (https://github.com/iquasere/UPIMAPI) and reCOGnizer (https://github.com/iquasere/reCOGnizer), which annotate genes with reference to UniProt and CDD databases using two different methods, providing complementary information. This is the same methodology used by widely popular tools such as eggNOG-mapper and Prokka, but these use other databases.
Related posts
-
Any programs/packages that will allow me to compare cluster annotations obtained from metagenomic data?
-
Is there a way to achieve a similar COG annotation for Haloferax mediterranei?
-
Is there a way to achieve a similar COG annotation for Haloferax mediterranei?
-
Why do some HMMs come with a separate score table instead of embedding in as cut_ga, cut_tc, or cut_nc?
-
Metadbgwas, my tool ! What could I do better ?