InfluxDB 3 OSS is now GA. Transform, enrich, and act on time series data directly in the database. Automate critical tasks and eliminate the need to move data externally. Download now. Learn more →
Top 22 Rust Text processing Projects
-
ripgrep
ripgrep recursively searches directories for a regex pattern while respecting your gitignore
Project mention: Zig, Rust, Go?! I tried 3 low-level languages and here’s what I’m sticking with | dev.to | 2025-08-28ripgrep (Rust project)
-
Sevalla
Deploy and host your apps and databases, now with $50 credit! Sevalla is the PaaS you have been looking for! Advanced deployment pipelines, usage-based pricing, preview apps, templates, human support by developers, and much more!
-
There is sd, not a drop in replacement though.
https://github.com/chmln/sd
-
regex
An implementation of regular expressions for Rust. This implementation uses finite automata and guarantees linear time matching on all inputs.
Project mention: fd: A simple, fast and user-friendly alternative to 'find' | news.ycombinator.com | 2025-03-19Yes. I'm the one who implemented the case folding the `fd` uses (via its regex engine).
See: https://github.com/rust-lang/regex/blob/master/UNICODE.md#rl...
And then Unicode itself for more discussion on the topic: https://unicode.org/reports/tr18/#Simple_Loose_Matches
TR18 used to have a Level 3[1] with the kind of locale-specific custom tailoring support found in GNU's implementation of POSIX locales, but it was so fraught that it was retracted completely some years ago.
[1]: https://unicode.org/reports/tr18/#Tailored_Support
-
-
-
-
Project mention: Oxidizing Ubuntu: adopting Rust utilities by default | news.ycombinator.com | 2025-03-21
And when you put people on a pedestal, they're guaranteed to let you down. :-) https://github.com/BurntSushi/aho-corasick/commit/474393be8d...
I wouldn't be surprised if that style of leak were more prevalent than one would expect. It's pretty subtle. But that link is the only such instance I'm aware of it happening to such a degree in crates I maintain. Maybe there are other instances. This is why I try to use `Box<[T]>` when possible, because you know that can't have extra capacity.
-
InfluxDB
InfluxDB – Built for High-Performance Time Series Workloads. InfluxDB 3 OSS is now GA. Transform, enrich, and act on time series data directly in the database. Automate critical tasks and eliminate the need to move data externally. Download now.
-
-
-
-
-
-
-
-
-
-
-
-
-
Matcher
A high-performance matcher designed to solve LOGICAL and TEXT VARIATIONS problems in word matching, implemented in Rust. (by Lips7)
-
untanglr
Probabilistically split concatenated words using NLP based on English Wikipedia unigram frequencies.
-
-
SaaSHub
SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives
Rust Text processing discussion
Rust Text processing related posts
-
🔥 Build a RAG Chatbot That Talks to Your Documents Using Python (Gemma + Qdrant + Docling)
-
Bzip2 crate switches from C to 100% rust
-
ripgrep: Not Just a Faster grep, but a Sharper One
-
How to combine rg with less in terminal
-
Resolving a mysterious problem with find
-
SD: Intuitive find and replace CLI (sed alternative)
-
Ripgrep Cheatsheet For Neovim Users
-
A note from our sponsor - InfluxDB
www.influxdata.com | 1 Sep 2025
Index
What are some of the best open-source Text processing projects in Rust? This list will help you:
# | Project | Stars |
---|---|---|
1 | ripgrep | 54,892 |
2 | sd | 6,462 |
3 | regex | 3,769 |
4 | pomsky | 1,321 |
5 | Fluent | 1,234 |
6 | nucleo | 1,204 |
7 | aho-corasick | 1,136 |
8 | whatlang-rs | 1,036 |
9 | hck | 723 |
10 | textwrap | 502 |
11 | suffix | 274 |
12 | tabwriter | 264 |
13 | UNIC | 242 |
14 | cpc | 148 |
15 | sliceslice-rs | 97 |
16 | srch | 76 |
17 | nlpo3 | 35 |
18 | ngrams | 27 |
19 | rule72 | 24 |
20 | Matcher | 17 |
21 | untanglr | 14 |
22 | rtr | 1 |