DNABERT
stanford-tensorflow-tutorials
DNABERT | stanford-tensorflow-tutorials | |
---|---|---|
1 | 2 | |
546 | 9,845 | |
- | - | |
3.1 | 0.0 | |
2 months ago | over 3 years ago | |
Python | Python | |
Apache License 2.0 | MIT License |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
DNABERT
-
[D] New to DNABERT
If I want to get started, they said it's optional to pre-train (so you can skip to step 3). This is where I got tripped up: "Note that the sequences are in kmer format, so you will need to convert your sequences into that." From what I understand, you need to do this so that all of the sequences are the same length? So kmer=6 means all of the sequences are length 6? Someone suggested that I take the first nucleotide in the promoter and grab 3 nucleotides before and 3 nucleotides after (+/-3 bases). I don't think that's how the kmer thing works though? I tried replicating how I think it works down below (I got confused on the last row of the 'after' df). Please correct me if I'm wrong!
stanford-tensorflow-tutorials
-
Rim Dillon
I’m not sure how to tell you this in a way that won’t deflate your outrage boner, but Stanford uses master in code: https://github.com/chiphuyen/stanford-tensorflow-tutorials
- [D] I'm trying to do more stuff in pure Tensorflow. Is there an in-depth book that explain constructing recurrent, convolutional, graph etc layers in it?
What are some alternatives?
courses - This repository is a curated collection of links to various courses and resources about Artificial Intelligence (AI)
spaCy - 💫 Industrial-strength Natural Language Processing (NLP) in Python
Stanza - Stanford NLP Python library for tokenization, sentence segmentation, NER, and parsing of many human languages
stanford-openie-python - Stanford Open Information Extraction made simple!
datasets - 🤗 The largest hub of ready-to-use datasets for ML models with fast, easy-to-use and efficient data manipulation tools
DeepDanbooru - AI based multi-label girl image classification system, implemented by using TensorFlow.
leetcode-compensation - Compensation analysis of leetcode.com/discuss/compensation.
nlp-recipes - Natural Language Processing Best Practices & Examples
tf-encrypted - A Framework for Encrypted Machine Learning in TensorFlow
bioconvert - Bioconvert is a collaborative project to facilitate the interconversion of life science data from one format to another.
ChatterBot - ChatterBot is a machine learning, conversational dialog engine for creating chat bots