SaaSHub helps you find the best software and product alternatives Learn more โ
Top 7 Rust Natural Language Processing Projects
-
lingua-rs
The most accurate natural language detection library for Rust, suitable for short text and mixed-language text
-
WorkOS
The modern identity platform for B2B SaaS. The APIs are flexible and easy-to-use, supporting authentication, user identity, and complex enterprise features like SSO and SCIM provisioning.
-
nlprule
A fast, low-resource Natural Language Processing and Text Correction library written in Rust.
-
InfluxDB
Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
Hugging Face seems to like Rust. They also wrote Tokenizers in Rust.
Project mention: I created a program that finds out which anki cards out of 50_000 are in english and deletes them in 2 minutes | /r/rust | 2023-10-23Discovery of Lingua: While working on a different project, I discovered the Lingua library.
Another interesting projects in the space:
- nlprule: https://github.com/bminixhofer/nlprule
- prosemd: https://github.com/kitten/prosemd-lsp
- cargo spellcheck: https://github.com/drahnr/cargo-spellcheck
Project mention: Show HN: Quickwit โ Cost-efficient Elasticsearch alternative on object storage | news.ycombinator.com | 2023-06-07- Another nice comment seen on HN ยซ it seems to be very easy to run, not very IO intensive, and running fine on a single node with modest hardware with >2 billion log rows. It has a really cool dynamic schema feature too.ยป [9]
Fun fact: at least 4 users are using Garage[10] as the object storage, this OSS project looks really promising and made the HN front page a few months ago[11], we really cherish the OSS for this kind of unexpected combination.
Any feedback positive/negative always greatly appreciated here!
[0] Quickwit repo: https://github.com/quickwit-oss/quickwit
[1] Searching the web under 1000$/month: https://news.ycombinator.com/item?id=27074481
[2] Chitchat gossip library: https://github.com/quickwit-oss/chitchat
[3] Columnar format: https://github.com/quickwit-oss/tantivy/tree/main/columnar
[4] Tantivy library: https://github.com/quickwit-oss/tantivy/
[5] Whichlang library: https://github.com/quickwit-oss/whichlang
[6] GitHub Archive demo in terminal: https://www.youtube.com/watch?v=SNq3bARRlDI
[7] Indexing performance: https://twitter.com/fulmicoton/status/1638016949459488768
[8] https://twitter.com/arnonrgo/status/1645429632303235073?s=20
[9] https://news.ycombinator.com/item?id=35742544
[10] Garage object storage: https://garagehq.deuxfleurs.fr/
[11] https://news.ycombinator.com/item?id=33853539
Rust Natural Language Processing related posts
- HF Transfer: Speed up file transfers
- Whichlang โ Fast, OSS for Language Detection in Rust
- Whichlang โ Fast, OSS for Language Detection in Rust
- LLM custom dictionary
- LanguageTool-Rust is releasing 1.0.0!
- What's everyone working on this week (33/2021)?
- Thai word tokenizers benchmark: nlpo3 vs newmm
-
A note from our sponsor - SaaSHub
www.saashub.com | 25 Apr 2024
Index
What are some of the best open-source Natural Language Processing projects in Rust? This list will help you:
Project | Stars | |
---|---|---|
1 | tokenizers | 8,395 |
2 | lingua-rs | 817 |
3 | nlprule | 570 |
4 | whichlang | 341 |
5 | instant-segment | 85 |
6 | dpar | 41 |
7 | nlpo3 | 30 |
Sponsored