Rust Text processing

Open-source Rust projects categorized as Text processing

Top 19 Rust Text processing Projects

  • ripgrep

    ripgrep recursively searches directories for a regex pattern while respecting your gitignore

  • Project mention: Ask HN: What software sparks joy when using? | news.ycombinator.com | 2024-04-17

    ripgrep - https://github.com/BurntSushi/ripgrep

  • sd

    Intuitive find & replace CLI (sed alternative)

  • Project mention: Essential Command Line Tools for Developers | dev.to | 2024-01-15

    View on GitHub

  • InfluxDB

    Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

    InfluxDB logo
  • regex

    An implementation of regular expressions for Rust. This implementation uses finite automata and guarantees linear time matching on all inputs.

  • Project mention: Zed is now open source | news.ycombinator.com | 2024-01-24

    The homepage has a benchmark that compares Zed's "insertion latency" to other editors, and this is the description:

    > Open input.rs at the end of line 21 in rust-lang/regex. Type z 10 times, measure how long it takes for each z to display since hitting the z key.

    Could someone clarify what that means? My interpretation of that was to go to https://github.com/rust-lang/regex/blob/master/regex-cli/arg... and start typing 'z' at the end of line 21, but that doesn't seem to make any sense. I guess that repo got refactored and those instructions are out of date?

  • pomsky

    A new, portable, regular expression language

  • Fluent

    Rust implementation of Project Fluent

  • Project mention: Libxo: The Easy Way to Generate Text, XML, JSON, and HTML Output | news.ycombinator.com | 2023-07-14

    > Typical printf usage is imperative and additive:

    > if (enter) printf("Hello "); else printf("Goodbye "); printf("World!\n");

    And unless you want your translator to hate you, you really, really mustn’t do this in user-facing output.

    (OK, you can if you really want to and if you’re ready to give them the same tools[1], but it won’t be simple. Although I’m unaware of any professional translators supporting this either—most use a CAT, and the Fluent approach ignores those.)

    [1] https://projectfluent.org/

  • whatlang-rs

    Natural language detection library for Rust. Try demo online: https://whatlang.org/

  • Project mention: Lingua 1.5.0 - The most accurate natural language detection library for Rust, now with support for detecting multiple languages in mixed-language text | /r/rust | 2023-06-15

    How does it compare to whatlang?

  • aho-corasick

    A fast implementation of Aho-Corasick in Rust.

  • Project mention: Aho-Corasick Algorithm | news.ycombinator.com | 2024-03-04
  • WorkOS

    The modern identity platform for B2B SaaS. The APIs are flexible and easy-to-use, supporting authentication, user identity, and complex enterprise features like SSO and SCIM provisioning.

    WorkOS logo
  • hck

    A sharp cut(1) clone.

  • Project mention: An old but good field command for printing tab separated fields from a file to stdou.t | /r/commandline | 2023-05-15
  • textwrap

    An efficient and powerful Rust library for word wrapping text.

  • suffix

    Fast suffix arrays for Rust (with Unicode support).

  • tabwriter

    Elastic tabstops for Rust.

  • UNIC

    UNIC: Unicode and Internationalization Crates for Rust

  • cpc

    Text calculator with support for units and conversion

  • sliceslice-rs

    A fast implementation of single-pattern substring search using SIMD acceleration.

  • srch

    Text search for humans

  • nlpo3

    Thai Natural Language Processing library in Rust, with Python and Node bindings.

  • ngrams

    (Read-only) Generate n-grams

  • untanglr

    Probabilistically split concatenated words using NLP based on English Wikipedia unigram frequencies.

  • rtr

    Command-line text processing tool (by 4rtzel)

  • SaaSHub

    SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives

    SaaSHub logo
NOTE: The open source projects on this list are ordered by number of github stars. The number of mentions indicates repo mentiontions in the last 12 Months or since we started tracking (Dec 2020). The latest post mention was on 2024-04-17.

Rust Text processing related posts

Index

What are some of the best open-source Text processing projects in Rust? This list will help you:

Project Stars
1 ripgrep 44,747
2 sd 5,334
3 regex 3,336
4 pomsky 1,259
5 Fluent 983
6 whatlang-rs 947
7 aho-corasick 946
8 hck 679
9 textwrap 424
10 suffix 253
11 tabwriter 243
12 UNIC 234
13 cpc 113
14 sliceslice-rs 87
15 srch 74
16 nlpo3 30
17 ngrams 27
18 untanglr 14
19 rtr 1

Sponsored
SaaSHub - Software Alternatives and Reviews
SaaSHub helps you find the best software and product alternatives
www.saashub.com