Open-source projects categorized as edit-distance Edit details

Top 11 edit-distance Open-Source Projects

  • SymSpell

    SymSpell: 1 million times faster spelling correction & fuzzy search through Symmetric Delete spelling correction algorithm

    Project mention: Auto correct/Auto complete feature | reddit.com/r/AskComputerScience | 2022-06-27

    If you want to do both at the same time (prefix search, allowing for misspellings), you can use a trie, but rather than just putting all your words in it, you can put everything in the "deletion neighborhood" of each word (that is, each possible variant of each word that has one character deleted), in an approach sort of like what's described here. Fair warning, though, that this gets a little hairy, and you'll have to decide how to weight prefix matches vs. misspellings in your rankings.

  • PolyFuzz

    Fuzzy string matching, grouping, and evaluation.

  • Scout APM

    Less time debugging, more time building. Scout APM allows you to find and fix performance issues with no hassle. Now with error monitoring and external services monitoring, Scout is a developer's best friend when it comes to application development.

  • edlib

    Lightweight, super fast C/C++ (& Python) library for sequence alignment using edit (Levenshtein) distance.

    Project mention: What's an efficient way to find multiple subsequences in several FASTQs? | reddit.com/r/bioinformatics | 2022-02-08

    I’ve got a similar situation. I was implementing the Smith-Waterman algorithm when I figured someone had to have already written a “fast” version of this. I found the edlib package (https://github.com/Martinsos/edlib) which does sequence alignment using Levenshtein distance. Essentially same DP algorithm as your traditional NW or SW only this is a C++ implementation with a Python wrapper. (I’m assuming you’re using Python, could be wrong though). The pertinent aspects of the output of this function contains the distance (dissimilarity) and the location (what index does the alignment start and end). This tool may go a ways to helping your pipeline. You could also look to metagenomic papers for inspiration as this is a problem (find a substring in a huge amount of data) that the community contends with all the time. Kmer based approach may also be useful if you want to attempt the alignment free path. Cheers.

  • js-levenshtein

    The most efficient JS implementation calculating the Levenshtein distance, i.e. the difference between two strings.

    Project mention: Is SQLite available in the browser? | reddit.com/r/learnprogramming | 2022-01-27

    You can write the algorithm from the pseudo code on Wikipedia, or use a library that already implemented it like: https://github.com/gustf/js-levenshtein

  • go-edlib

    📚 String comparison and edit distance algorithms library, featuring : Levenshtein, LCS, Hamming, Damerau levenshtein (OSA and Adjacent transpositions algorithms), Jaro-Winkler, Cosine, etc...

  • Quickenshtein

    Making the quickest and most memory efficient implementation of Levenshtein Distance with SIMD and Threading support

  • edits.cr

    Edit distance algorithms inc. Jaro, Damerau-Levenshtein, and Optimal Alignment

  • SonarQube

    Static code analysis for 29 languages.. Your projects are multi-language. So is SonarQube analysis. Find Bugs, Vulnerabilities, Security Hotspots, and Code Smells so you can release quality code every time. Get started analyzing your projects today for free.

  • distlib

    Distance related functions (Damerau-Levenshtein, Jaro-Winkler , longest common substring & subsequence) implemented as SQLite run-time loadable extension. Any UTF-8 strings are supported.

  • JavaPermutationTools

    A Java library for computation on permutations and sequences

  • edit-distance-linear

    Levenshtein edit distance in linear memory (also turns out to be faster than C++)

  • Edits

    Edit distance algorithms inc. Jaro, Damerau-Levenshtein, and Optimal Alignment

NOTE: The open source projects on this list are ordered by number of github stars. The number of mentions indicates repo mentiontions in the last 12 Months or since we started tracking (Dec 2020). The latest post mention was on 2022-06-27.

edit-distance related posts


What are some of the best open-source edit-distance projects? This list will help you:

Project Stars
1 SymSpell 2,619
2 PolyFuzz 532
3 edlib 387
4 js-levenshtein 344
5 go-edlib 336
6 Quickenshtein 175
7 edits.cr 16
8 distlib 12
9 JavaPermutationTools 5
10 edit-distance-linear 3
11 Edits 2
Find remote jobs at our new job board 99remotejobs.com. There are 3 new remote jobs listed recently.
Are you hiring? Post a new remote job listing for free.
SaaSHub - Software Alternatives and Reviews
SaaSHub helps you find the best software and product alternatives