Our great sponsors
-
InfluxDB
Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
https://github.com/huggingface/transformers/tree/master/exam...
If you are interested in more details about the design considerations when setting up a large dataset, building efficient tokenizers, and architecture choices, make sure you have a look at the CodeParrot chapter in the upcoming book on Transformers and NLP: https://learning.oreilly.com/library/view/natural-language-p...
NOTE:
The number of mentions on this list indicates mentions on common posts plus user suggested alternatives.
Hence, a higher number means a more popular project.
Related posts
- [P] OSLO: Open Source framework for Large-scale transformer Optimization
- NLP - How to get correlated words?
- Self-hosted sentiment/social media analysis?
- [D] For those of you working as NLP Engineers in Industry, what should you learn to get up to par?
- Fill multiple tokens for one [MASK] in Masked Language Modelling