string-matching

Open-source projects categorized as string-matching

Top 22 string-matching Open-Source Projects

  • RapidFuzz

    Rapid fuzzy string matching in Python using various string metrics

  • Project mention: RapidFuzz: Rapid fuzzy string matching in Python | news.ycombinator.com | 2024-02-14
  • StringZilla

    Up to 10x faster strings for C, C++, Python, Rust, and Swift, leveraging SWAR and SIMD on Arm Neon and x86 AVX2 & AVX-512-capable chips to accelerate search, sort, edit distances, alignment scores, etc 🦖

  • Project mention: Measuring energy usage: regular code vs. SIMD code | news.ycombinator.com | 2024-02-19

    The 3.5x energy-efficiency gap between serial and SIMD code becomes even larger when

    A. you do byte-level processing instead of float words;

    B. you use embedded, IoT, and other low-energy devices.

    A few years ago I've compared Nvidia Jetson Xavier (long before the Orin release), Intel-based MacBook Pro with Core i9, and AVX-512 capable CPUs on substring search benchmarks.

    On Xavier one can quite easily disable/enable cores and reconfigure power usage. At peak I got to 4.2 GB/J which was an 8.3x improvement in inefficiency over LibC in substring search operations. The comparison table is still available in the older README: https://github.com/ashvardanian/StringZilla/tree/v2.0.2?tab=...

  • InfluxDB

    Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

    InfluxDB logo
  • PolyFuzz

    Fuzzy string matching, grouping, and evaluation.

  • Project mention: "We have great datasets" | /r/dataengineering | 2023-06-08
  • go-edlib

    📚 String comparison and edit distance algorithms library, featuring : Levenshtein, LCS, Hamming, Damerau levenshtein (OSA and Adjacent transpositions algorithms), Jaro-Winkler, Cosine, etc...

  • closestmatch

    Golang library for fuzzy matching within a set of strings :page_with_curl:

  • strutil-go

    Golang metrics for calculating string similarity and other string utility functions (by adrg)

  • simplematch

    Minimal, super readable string pattern matching for python.

  • WorkOS

    The modern identity platform for B2B SaaS. The APIs are flexible and easy-to-use, supporting authentication, user identity, and complex enterprise features like SSO and SCIM provisioning.

    WorkOS logo
  • trrex

    Efficient string matching with regular expressions (by mesejo)

  • LGenerics

    Generic algorithms and data structures for Lazarus/Free Pascal

  • wildmatch

    Simple string matching with single- and multiple-wildcard operator

  • JaroWinkler

    Python library for fast approximate string matching using Jaro and Jaro-Winkler similarity

  • Project mention: Manjaro Package Installation Error | /r/ManjaroLinux | 2023-06-18

    _name=jarowinkler pkgname=python-$_name pkgver=1.2.3 pkgrel=2 pkgdesc='A library for fast approximate string matching using Jaro and Jaro-Winkler similarity' arch=(x86_64) url='https://github.com/maxbachmann/JaroWinkler' license=(MIT) depends=(python) makedepends=(jarowinkler-cpp python-rapidfuzz-capi python-scikit-build)

  • ATGValidator

    iOS validation framework with form validation support

  • multi_string_replace

    A fast multiple string replace library for ruby. Uses a C implementation of the Aho–Corasick Algorithm based on https://github.com/morenice/ahocorasick while adding support for on the fly multiple string replacement. Faster alternative to String.gsub when dealing with non-regex (exact match) use cases

  • boyermoore

    Boyer-moore in pure python, search for unicode strings in large files quickly (by eriknyquist)

  • STS-Crawler

    A python reddit bot for /r/SlayTheSpire that automatically (soft) finds cards and relics mentioned in post titles and comments with descriptions the help new players

  • wordlexcel

    An Excel spreadsheet to help you solve WORDLE puzzles

  • libaca

    C Library of Aho-Corasick Algorithm based on Coordinate Hash Trie

  • Name-QuickSearch

    Find the best fuzzy match for a natural language string in a set of hundreds of thousands of strings in a split second.

  • wildmatch-go

    Simple string matching with questionmark- and star-wildcard operator

  • mscs-thesis-project

    Evaluating Languages for Bioinformatics: Performance, Expressiveness and Energy

  • wordle-solver

    Wordle Game Solver (by lyhsieh)

  • Project mention: Show HN: A fast, accurate and multilingual fuzzy search lib for the front end | news.ycombinator.com | 2024-02-14
  • SaaSHub

    SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives

    SaaSHub logo
NOTE: The open source projects on this list are ordered by number of github stars. The number of mentions indicates repo mentiontions in the last 12 Months or since we started tracking (Dec 2020).

string-matching related posts

Index

What are some of the best open-source string-matching projects? This list will help you:

Project Stars
1 RapidFuzz 2,348
2 StringZilla 1,776
3 PolyFuzz 716
4 go-edlib 444
5 closestmatch 416
6 strutil-go 276
7 simplematch 173
8 trrex 134
9 LGenerics 102
10 wildmatch 68
11 JaroWinkler 52
12 ATGValidator 51
13 multi_string_replace 21
14 boyermoore 19
15 STS-Crawler 18
16 wordlexcel 13
17 libaca 7
18 Name-QuickSearch 4
19 wildmatch-go 4
20 mscs-thesis-project 3
21 wordle-solver 3
22 fuzzy-search 0

Sponsored
SaaSHub - Software Alternatives and Reviews
SaaSHub helps you find the best software and product alternatives
www.saashub.com