Top 6 Tokenization Open-Source Projects
Ravencoin Core integration/staging treeProject mention: What is Ravencoin? (alert: this is not a hype-post) | reddit.com/r/Ravencoin | 2021-05-07
Most of the recent commits are for the ravencoin.org website while latest commit to ravencoin repository is 17-Jan. Some works for RVN core is being done on PRs but half (3/6) are about GUI & document. Also, roadmap doesn't have ETA (https://github.com/RavenProject/Ravencoin/blob/master/roadmap/README.md).
Secure storage for personal records built to comply with GDPRProject mention: Hottest Israel Startup with Open-Source Spirit | reddit.com/r/Israel | 2021-04-08
Check out the project website for additional information: https://databunker.org/
Scout APM - Leading-edge performance monitoring starting at $39/month. Scout APM uses tracing logic that ties bottlenecks to source code so you know the exact line of code causing performance issues and can get back to building a great product faster.
Trankit is a Light-Weight Transformer-based Python Toolkit for Multilingual Natural Language ProcessingProject mention: Trankit v1.0.0 - An open-source Transformer-based Multilingual NLP Toolkit for 56 languages is out. | reddit.com/r/LanguageTechnology | 2021-03-31
Trankit is written in Python and can be easily installed via pip. Our code and pretrained models are publicly available at: https://github.com/nlp-uoregon/trankit
TokenScript schema, specs and paperProject mention: Daily General Discussion - March 9, 2021 | reddit.com/r/ethfinance | 2021-03-09
FPE - Format Preserving Encryption with FF3 in PythonProject mention: Release of format preserving encryption in Python | news.ycombinator.com | 2021-03-15
English lite language model for wink-nlp.Project mention: How to tokenize a string? | dev.to | 2021-02-09