SaaSHub helps you find the best software and product alternatives Learn more →
Top 19 Rust Text processing Projects
-
ripgrep
ripgrep recursively searches directories for a regex pattern while respecting your gitignore
-
InfluxDB
Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
-
regex
An implementation of regular expressions for Rust. This implementation uses finite automata and guarantees linear time matching on all inputs.
-
SaaSHub
SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives
-
untanglr
Probabilistically split concatenated words using NLP based on English Wikipedia unigram frequencies.
-
SaaSHub
SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives
ripgrep - https://github.com/BurntSushi/ripgrep
View on GitHub
The homepage has a benchmark that compares Zed's "insertion latency" to other editors, and this is the description:
> Open input.rs at the end of line 21 in rust-lang/regex. Type z 10 times, measure how long it takes for each z to display since hitting the z key.
Could someone clarify what that means? My interpretation of that was to go to https://github.com/rust-lang/regex/blob/master/regex-cli/arg... and start typing 'z' at the end of line 21, but that doesn't seem to make any sense. I guess that repo got refactored and those instructions are out of date?
Project mention: Libxo: The Easy Way to Generate Text, XML, JSON, and HTML Output | news.ycombinator.com | 2023-07-14> Typical printf usage is imperative and additive:
> if (enter) printf("Hello "); else printf("Goodbye "); printf("World!\n");
And unless you want your translator to hate you, you really, really mustn’t do this in user-facing output.
(OK, you can if you really want to and if you’re ready to give them the same tools[1], but it won’t be simple. Although I’m unaware of any professional translators supporting this either—most use a CAT, and the Fluent approach ignores those.)
[1] https://projectfluent.org/
Project mention: Lingua 1.5.0 - The most accurate natural language detection library for Rust, now with support for detecting multiple languages in mixed-language text | /r/rust | 2023-06-15How does it compare to whatlang?
Project mention: An old but good field command for printing tab separated fields from a file to stdou.t | /r/commandline | 2023-05-15
Rust Text processing related posts
-
Ripgrep
-
CryptoFlow: Building a secure and scalable system with Axum and SvelteKit - Part 3
-
Text Editor: Data Structures
-
what terminal emulator do you use and why?
-
Command Line Rust is a great book
-
Common Rust Lifetime Misconceptions
-
Advent of Code 2023 is nigh
-
A note from our sponsor - SaaSHub
www.saashub.com | 9 May 2024
Index
What are some of the best open-source Text processing projects in Rust? This list will help you:
Project | Stars | |
---|---|---|
1 | ripgrep | 45,156 |
2 | sd | 5,385 |
3 | regex | 3,362 |
4 | pomsky | 1,260 |
5 | Fluent | 1,002 |
6 | aho-corasick | 953 |
7 | whatlang-rs | 952 |
8 | hck | 683 |
9 | textwrap | 428 |
10 | suffix | 253 |
11 | tabwriter | 244 |
12 | UNIC | 234 |
13 | cpc | 114 |
14 | sliceslice-rs | 87 |
15 | srch | 74 |
16 | nlpo3 | 30 |
17 | ngrams | 27 |
18 | untanglr | 14 |
19 | rtr | 1 |
Sponsored