Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality. Learn more →
Top 15 Rust NLP Projects
-
rust-bert
Rust native ready-to-use NLP pipelines and transformer-based models (BERT, DistilBERT, GPT2,...)
-
InfluxDB
Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
-
lingua-rs
The most accurate natural language detection library for Rust, suitable for short text and mixed-language text
-
nlprule
A fast, low-resource Natural Language Processing and Text Correction library written in Rust.
-
cargo-spellcheck
Checks all your documentation for spelling and grammar mistakes with hunspell and a nlprule based checker for grammar
-
SaaSHub
SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives
-
SaaSHub
SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives
Hugging Face seems to like Rust. They also wrote Tokenizers in Rust.
Project mention: How to leverage the state-of-the-art NLP models in Rust | /r/infinilabs | 2023-06-07brew install libtorch brew link libtorch brew ls --verbose libtorch | grep dylib export LIBTORCH=$(brew --cellar pytorch)/$(brew info --json pytorch | jq -r '.[0].installed[0].version') export LD_LIBRARY_PATH=${LIBTORCH}/lib:$LD_LIBRARY_PATH git clone https://github.com/guillaume-be/rust-bert.git cd rust-bert ORT_STRATEGY=system cargo run --example sentence_embeddings
Project mention: Lingua 1.5.0 - The most accurate natural language detection library for Rust, now with support for detecting multiple languages in mixed-language text | /r/rust | 2023-06-15How does it compare to whatlang?
Project mention: I created a program that finds out which anki cards out of 50_000 are in english and deletes them in 2 minutes | /r/rust | 2023-10-23Discovery of Lingua: While working on a different project, I discovered the Lingua library.
Another interesting projects in the space:
- nlprule: https://github.com/bminixhofer/nlprule
- prosemd: https://github.com/kitten/prosemd-lsp
- cargo spellcheck: https://github.com/drahnr/cargo-spellcheck
Another interesting projects in the space:
- nlprule: https://github.com/bminixhofer/nlprule
- prosemd: https://github.com/kitten/prosemd-lsp
- cargo spellcheck: https://github.com/drahnr/cargo-spellcheck
Our code runs against an extensive test suite of examples from the Kashika Vrtti and the Siddhanta Kaumudi. Are there bugs? Yes, and we know where most of them are due to our test suite (look for the #[ignore] annotation for tests with at least one unsupported word.) Happily, the number of bugs here is decreasing over time.
Project mention: Rust Keyword Extraction: Creating the YAKE! algorithm from scratch | dev.to | 2024-04-27All the code discussed in this article can be accessed through this repository. For integration with existing projects consider using keyword_extraction crate available on crates.io.
Rust NLP related posts
-
Rust Keyword Extraction: Creating the YAKE! algorithm from scratch
-
A Paninian [Sanskrit] word generator
-
Vale.sh – A Linter for Prose
-
HF Transfer: Speed up file transfers
-
Creating search engine for your local network - Is it even possible?
-
Lingua 1.5.0 - The most accurate natural language detection library for Rust, now with support for detecting multiple languages in mixed-language text
-
Is anyone doing Machine Learning in Rust?
-
A note from our sponsor - InfluxDB
www.influxdata.com | 10 May 2024
Index
What are some of the best open-source NLP projects in Rust? This list will help you:
Project | Stars | |
---|---|---|
1 | tokenizers | 8,458 |
2 | rust-bert | 2,427 |
3 | whatlang-rs | 952 |
4 | lingua-rs | 824 |
5 | nlprule | 574 |
6 | cargo-spellcheck | 310 |
7 | txtai.rs | 98 |
8 | vidyut | 44 |
9 | treebender | 39 |
10 | yozuk | 37 |
11 | bytepiece-rs | 14 |
12 | whatlang-pyo3 | 11 |
13 | keyword-extraction-rs | 9 |
14 | semdesk | 1 |
15 | upsc3ne | 1 |
Sponsored