orfipy
reCOGnizer
Our great sponsors
orfipy | reCOGnizer | |
---|---|---|
2 | 9 | |
46 | 24 | |
- | - | |
1.8 | 7.4 | |
over 2 years ago | 4 months ago | |
Python | HTML | |
MIT License | BSD 3-clause "New" or "Revised" License |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
orfipy
-
Looking for a tool to convert a whole fasta file with CDS sequences to a fasta file with protein sequences.
You can probably use or adapt https://github.com/urmi-21/orfipy. I've never used this but it looks fantastic.
-
Question about ORF finder
You don't need any specific hardware to run most bioinformatics programs. There is a web application of ORF finder and if you want to run large amounts of analyses you probably want to run them in a Linux environment. Look at Windows Subsystem for Linux if you are a Windows user. You may also be interested in new ORF finders like this one: https://github.com/urmi-21/orfipy
reCOGnizer
-
Is there a way to achieve a similar COG annotation for Haloferax mediterranei?
reCOGnizer (https://github.com/iquasere/reCOGnizer) will do that for you. It also outputs the COG categories in a krona plot
-
Why do some HMMs come with a separate score table instead of embedding in as cut_ga, cut_tc, or cut_nc?
What tool are you using for analyzing them? Using something like reCOGnizer (https://github.com/iquasere/reCOGnizer) gives you the bitscores and evalues in a TSV table, next to the matches
-
Metadbgwas, my tool ! What could I do better ?
Finally, a huge help on developing is having a Continuous Integration/Deployment on the tool. CD refers to the conda suggestion above, as your tool would be readily available in a controlled environment. CI could use the GitHub Actions functionality provided by GitHub. You can see an example here
-
Any programs/packages that will allow me to compare cluster annotations obtained from metagenomic data?
You may run MOSCA (https://github.com/iquasere/MOSCA), it performs all major steps of metagenomics analysis. It includes that functional classification you are looking for, since with UPIMAPI (https://github.com/iquasere/UPIMAPI) it annotates with UniProt DB as reference, and obtains information including taxonomy, EC numbers, and even those GOs, and reCOGnizer (https://github.com/iquasere/reCOGnizer), which annotates with CDD DB as reference, and obtains orthologous groups information (COG, Pfam, etc).
-
Metatranscriptomics Workflow Questions?
Prediction of coding sequences takes as input the contigs you obtained, and gives you the translated genes. Besides annotating with the KEGG database, you may also want to annotate with more general purpose databases (e.g. UniProt), as these provide more taxonomies and functional information. MOSCA includes UPIMAPI (https://github.com/iquasere/UPIMAPI) and reCOGnizer (https://github.com/iquasere/reCOGnizer), which annotate genes with reference to UniProt and CDD databases using two different methods, providing complementary information. This is the same methodology used by widely popular tools such as eggNOG-mapper and Prokka, but these use other databases.
-
Problems using Prokka
Install with mamba instead of conda, no more problems. Or use UPIMAPI (https://github.com/iquasere/UPIMAPI) together with reCOGnizer (https://github.com/iquasere/reCOGnizer), since these tools obtain better results when annotating proteins
-
How would you name the program that does cog annotation?
I think reCOGnizer is a nice name
-
Is there any other tool for COG annotation of the bacterial genome than EggNOG mapper?
reCOGnizer (https://github.com/iquasere/reCOGnizer) can annotate with COGs, and the other databases available at CDD. It obtains all information concerning COGs description and categories, and outputs krona plots and TSV tables in formats easy to analyze. There is also mantis (https://github.com/PedroMTQ/mantis), prokka (https://github.com/tseemann/prokka) and DFAST (https://github.com/nigyta/dfast_core), the latter two work on contigs and the first two are for proteins
What are some alternatives?
galaxy - Data intensive science for everyone.
Ory Keto - Open Source (Go) implementation of "Zanzibar: Google's Consistent, Global Authorization System". Ships gRPC, REST APIs, newSQL, and an easy and granular permission language. Supports ACL, RBAC, and other access models.
Biopython - Official git repository for Biopython (originally converted from CVS)
prokka - :zap: :aquarius: Rapid prokaryotic genome annotation
pyrodigal - Cython bindings and Python interface to Prodigal, an ORF finder for genomes and metagenomes. Now with SIMD!
guardian_db - Guardian DB integration for tracking tokens and ensuring logout cannot be replayed.
pyfaidx - Efficient pythonic random access to fasta subsequences
progen - Official release of the ProGen models