DNABERT
courses
DNABERT | courses | |
---|---|---|
1 | 7 | |
546 | 4,573 | |
- | - | |
3.1 | 5.4 | |
2 months ago | 22 days ago | |
Python | Python | |
Apache License 2.0 | - |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
DNABERT
-
[D] New to DNABERT
If I want to get started, they said it's optional to pre-train (so you can skip to step 3). This is where I got tripped up: "Note that the sequences are in kmer format, so you will need to convert your sequences into that." From what I understand, you need to do this so that all of the sequences are the same length? So kmer=6 means all of the sequences are length 6? Someone suggested that I take the first nucleotide in the promoter and grab 3 nucleotides before and 3 nucleotides after (+/-3 bases). I don't think that's how the kmer thing works though? I tried replicating how I think it works down below (I got confused on the last row of the 'after' df). Please correct me if I'm wrong!
courses
-
If you are looking for free courses about AI, LLMs, CV, or NLP, I created the repository with links to resources that I found super high quality and helpful. The link is in the comment.
I found it: https://github.com/SkalskiP/courses
https://github.com/SkalskiP/courses/discussions/20. Wouldhttps://github.com/SkalskiP/courses/discussions/20 that format be helpful?
- Repo con cursos gratis sobre IA (en inglés)
- GitHub - SkalskiP/courses: This repository is a curated collection of links to various courses and resources about Artificial Intelligence (AI)
-
Anyone know of any good video lectures for Computer Vision? From great professors at well-regarded universities
This is a popular repo with cv specific resources https://github.com/SkalskiP/courses
- Curated collection of high quality ai resources
-
If you are looking for courses about Artificial Intelligence, I created the repository with links to resources that I found super high quality and helpful. The link is in the comment.
link: https://github.com/SkalskiP/courses
What are some alternatives?
Stanza - Stanford NLP Python library for tokenization, sentence segmentation, NER, and parsing of many human languages
course - The Hugging Face course on Transformers
datasets - 🤗 The largest hub of ready-to-use datasets for ML models with fast, easy-to-use and efficient data manipulation tools
WhereIsAI - AI company, product, and tool collection.
stanford-tensorflow-tutorials - This repository contains code examples for the Stanford's course: TensorFlow for Deep Learning Research.
vectory - Vectory provides a collection of tools to track and compare embedding versions.
spaCy - 💫 Industrial-strength Natural Language Processing (NLP) in Python
bearid - Hypraptive BearID project. FaceNet for bears.
nlp-recipes - Natural Language Processing Best Practices & Examples
Mask3D - Mask3D predicts accurate 3D semantic instances achieving state-of-the-art on ScanNet, ScanNet200, S3DIS and STPLS3D.
bioconvert - Bioconvert is a collaborative project to facilitate the interconversion of life science data from one format to another.
wit - WIT (Wikipedia-based Image Text) Dataset is a large multimodal multilingual dataset comprising 37M+ image-text sets with 11M+ unique images across 100+ languages.