pythainlp
uni2db
pythainlp | uni2db | |
---|---|---|
2 | 1 | |
926 | 5 | |
0.1% | - | |
9.5 | 7.0 | |
4 days ago | 7 months ago | |
Python | Python | |
Apache License 2.0 | GNU General Public License v3.0 only |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
pythainlp
-
PyThaiNLP 2.4.0-dev0
Read more about PyThaiNLP v2.4.0-dev0: https://github.com/PyThaiNLP/pythainlp/releases/tag/v2.4.0-dev0
-
Thai word tokenizers benchmark: nlpo3 vs newmm
Thanathip Suntorntip Gorlph ported Korakot Chaovavanich's Thai word tokenizer - Newmm, written in Python, to Rust called nlpo3. The nlpo3 website claimed that nlpo3 is 2X faster than Newmm. I felt that Nlpo3 must be faster than this claim because in contrast to Python's Regex engine, Rust's regex runs in the linear time since it was constrained not to support looking back/ahead. Moreover, 2X faster is ambiguous.
uni2db
What are some alternatives?
nlpo3 - Thai Natural Language Processing library in Rust, with Python and Node bindings.
New-Grad-2024 - π Hey there new gradπ! We've put together a collection of full-time job openings for SWE, Quant, PM and tech roles in 2024! π
PyProjects - Beginner Friendly Python-Projects
Summer-2024-SWE-Internships - A list of Summer 2024 internships for software engineering, updated automatically everyday
toiro - A comparison tool of Japanese tokenizers
TikTokBot - A TikTokBot that downloads trending tiktok videos and compiles them using FFmpeg
google-play-scraper - Google play scraper for Python inspired by <facundoolano/google-play-scraper>
Dorkify - Perform Google Dork search with Dorkify
abydos - Abydos NLP/IR library for Python
echo360 - Commandline tool for automated downloads of echo360 videos hosted by university
transformers - π€ Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
Summer2023-Internships - Collection of Summer 2023 & Summer 2024 tech internships! [Moved to: https://github.com/pittcsc/Summer2024-Internships]