tape
text
Our great sponsors
tape | text | |
---|---|---|
1 | 2 | |
620 | 3,439 | |
2.4% | 0.6% | |
0.0 | 6.9 | |
over 1 year ago | 6 days ago | |
Python | Python | |
BSD 3-clause "New" or "Revised" License | BSD 3-clause "New" or "Revised" License |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
tape
-
ProteinBERT: A universal deep-learning model of protein sequence and function
We evaluated based on downstream tasks (multiple supervised benchmarks, including 4 from TAPE), not the LM performance.
text
-
torchtext load csv file of strings and tokenize
I also checked the github repo: https://github.com/pytorch/text
-
Tutorials/walkthroughs of torchtext 0.9 anywhere?
You can find the migration tutorial here https://github.com/pytorch/text/blob/master/examples/legacy_tutorial/migration_tutorial.ipynb
What are some alternatives?
protein-bert-pytorch - Implementation of ProteinBERT in Pytorch
SFDX-Data-Move-Utility - SFDMU is a cutting-edge Salesforce data migration tool for seamless org population from other orgs or CSV files. It handles all CRUD operations on multiple related objects in one go.
fashion-mnist - A MNIST-like fashion product database. Benchmark :point_down:
LaTeX-OCR - pix2tex: Using a ViT to convert images of equations into LaTeX code.
beir - A Heterogeneous Benchmark for Information Retrieval. Easy to use, evaluate your models across 15+ diverse IR datasets.
Activeloop Hub - Data Lake for Deep Learning. Build, manage, query, version, & visualize datasets. Stream data real-time to PyTorch/TensorFlow. https://activeloop.ai [Moved to: https://github.com/activeloopai/deeplake]
ProFET - ProFET: Protein Feature Engineering Toolkit for Machine Learning
Chart2Text - Chart-to-Text: Generating Natural Language Explanations for Charts by Adapting the Transformer Model
evodiff - Generation of protein sequences and evolutionary alignments via discrete diffusion models
sru - Training RNNs as Fast as CNNs (https://arxiv.org/abs/1709.02755)
pypdb - A Python API for the RCSB Protein Data Bank (PDB)
Words Counted - A Ruby natural language processor.