DNABERT
nlp-recipes
DNABERT | nlp-recipes | |
---|---|---|
1 | 5 | |
546 | 6,020 | |
- | - | |
3.1 | 0.0 | |
2 months ago | over 1 year ago | |
Python | Python | |
Apache License 2.0 | MIT License |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
DNABERT
-
[D] New to DNABERT
If I want to get started, they said it's optional to pre-train (so you can skip to step 3). This is where I got tripped up: "Note that the sequences are in kmer format, so you will need to convert your sequences into that." From what I understand, you need to do this so that all of the sequences are the same length? So kmer=6 means all of the sequences are length 6? Someone suggested that I take the first nucleotide in the promoter and grab 3 nucleotides before and 3 nucleotides after (+/-3 bases). I don't think that's how the kmer thing works though? I tried replicating how I think it works down below (I got confused on the last row of the 'after' df). Please correct me if I'm wrong!
nlp-recipes
-
Show HN: I turned my microeconomics textbook into a chatbot with GPT-3
https://github.com/topics/automatic-summarization
Microsoft/nlp-recipes lists current NLP tasks that would be helpful for a docs bot: https://github.com/microsoft/nlp-recipes#content
-
Show HN: DocsGPT, open-source documentation assistant, fully aware of libraries
https://github.com/topics/automatic-summarization
Though now archived,
> Microsoft/nlp-recipes lists current NLP tasks that would be helpful for a docs bot: https://github.com/microsoft/nlp-recipes#content
NLP Tasks: Text Classification, Named Entity Recognition, Text Summarization, Entailment, Question Answering, Sentence Similarity, Embeddings, Sentiment Analysis, Model Explainability, and Auto-Annotatiom
- ✨ 5 Free Resources for Learning Natural Language Processing with Python 🚀
-
Is there any utility software/bot that produces descriptor tags for a Reddit image post using the comments?
I found this (https://github.com/microsoft/nlp-recipes) resource and it has a list of pre-built or easily customizable NLP models that I'm going to try out.
-
Building a Aspect based sentiment classification
There is an NLP recipe from Microsoft on ABSA. Have you seen this? https://github.com/microsoft/nlp-recipes/blob/master/examples/sentiment_analysis/absa/absa.ipynb
What are some alternatives?
courses - This repository is a curated collection of links to various courses and resources about Artificial Intelligence (AI)
ludwig - Low-code framework for building custom LLMs, neural networks, and other AI models
Stanza - Stanford NLP Python library for tokenization, sentence segmentation, NER, and parsing of many human languages
OpenPrompt - An Open-Source Framework for Prompt-Learning.
datasets - 🤗 The largest hub of ready-to-use datasets for ML models with fast, easy-to-use and efficient data manipulation tools
rasa - 💬 Open source machine learning framework to automate text- and voice-based conversations: NLU, dialogue management, connect to Slack, Facebook, and more - Create chatbots and voice assistants
stanford-tensorflow-tutorials - This repository contains code examples for the Stanford's course: TensorFlow for Deep Learning Research.
deepsegment - A sentence segmenter that actually works!
spaCy - 💫 Industrial-strength Natural Language Processing (NLP) in Python
pymarl2 - Fine-tuned MARL algorithms on SMAC (100% win rates on most scenarios)
bioconvert - Bioconvert is a collaborative project to facilitate the interconversion of life science data from one format to another.
Parrot_Paraphraser - A practical and feature-rich paraphrasing framework to augment human intents in text form to build robust NLU models for conversational engines. Created by Prithiviraj Damodaran. Open to pull requests and other forms of collaboration.