NLP-progress VS SymSpell

Compare NLP-progress vs SymSpell and see what are their differences.

NLP-progress

Repository to track the progress in Natural Language Processing (NLP), including the datasets and the current state-of-the-art for the most common NLP tasks. (by sebastianruder)
Our great sponsors
  • WorkOS - The modern identity platform for B2B SaaS
  • InfluxDB - Power Real-Time Data Analytics at Scale
  • SaaSHub - Software Alternatives and Reviews
NLP-progress SymSpell
17 16
22,290 3,032
- -
3.2 6.0
2 months ago 14 days ago
Python C#
MIT License MIT License
The number of mentions indicates the total number of mentions that we've tracked plus the number of user suggested alternatives.
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.

NLP-progress

Posts with mentions or reviews of NLP-progress. We have used some of these posts to build our list of alternatives and similar projects. The last one was on 2021-11-11.

SymSpell

Posts with mentions or reviews of SymSpell. We have used some of these posts to build our list of alternatives and similar projects. The last one was on 2023-08-30.
  • Show HN: I automated 1/2 of my typing
    11 projects | news.ycombinator.com | 30 Aug 2023
  • Learn more about spell checkers
    2 projects | /r/nlp_knowledge_sharing | 18 Mar 2023
    Books: a. "Speech and Language Processing" by Daniel Jurafsky and James H. Martin (3rd Edition) - This book covers various aspects of natural language processing, including a section on spelling correction that provides a comprehensive introduction to the topic. b. "Foundations of Statistical Natural Language Processing" by Christopher D. Manning and Hinrich SchĂĽtze - This book provides an overview of statistical approaches in NLP, including a chapter on spelling correction. Articles: a. "How to Write a Spelling Corrector" by Peter Norvig - This article demonstrates the development of a simple spelling corrector using statistical algorithms. It's a great starting point for understanding the basics of spell checkers. (Link: https://norvig.com/spell-correct.html) b. "The Design of a Proofreading Software Service" by Michael D. Garris and James L. Blue - This article presents the design and implementation of a spelling correction system that can be integrated into various applications. (Link: https://www.nist.gov/system/files/documents/itl/iad/89403123.pdf) c. "A Fast and Flexible Spellchecker" by Atkinson, K. (2006) - This article details the design of a spell checker that uses a combination of rule-based and statistical approaches for improved performance. (Link: https://aspell.net/0.60.6.1/aspell-0.60.6.1.pdf) Online Resources: a. The Natural Language Toolkit (NLTK) - This is a popular Python library for natural language processing. It includes a spell checker module and various examples of how to use it. (Link: https://www.nltk.org/) b. SymSpell - This is an open-source spell checking library that uses a Symmetric Delete spelling correction algorithm for high performance and accuracy. The GitHub repository includes a detailed description of the algorithm and examples of how to use it. (Link: https://github.com/wolfgarbe/SymSpell) These resources should provide a solid foundation for understanding the design, algorithms, and usage of spell checkers. Happy learning!
  • Turn the spellchecker into autocorrection software
    2 projects | /r/learnprogramming | 13 Feb 2023
    Can this github.com/wolfgarbe/SymSpell or this github.com/ruby/did_you_mean or any of these github.com/topics/spell-check?o=desc&s=forks spellcheckers be used as an autocorrection software?
  • Hacker News top posts: Mar 6, 2022
    3 projects | /r/hackerdigest | 6 Mar 2022
    SymSpell: 1M times faster spelling correction\ (6 comments)
  • SymSpell: 1M times faster spelling correction
    8 projects | news.ycombinator.com | 6 Mar 2022
  • Typo correction using NLP
    4 projects | /r/LanguageTechnology | 19 Mar 2021
    SymSpell
  • Fuzzy Name Matching in Postgres
    3 projects | news.ycombinator.com | 24 Feb 2021
    I'm glad to see these built-in to Postgres, as these are the basics of fuzzy string matching.

    A quantum leap would be to integrate an implementation of the symmetric delete algorithm, such as https://github.com/wolfgarbe/SymSpell

    Soundex and Phonex can yield too many false negatives outside of phonetically English names. Levenshtein/Jaro-Winkler aren't indexable solutions themselves, so they require N^2 comparisons. SymSpell conceptually combines these two into an indexed string-distance solution. It has the usual index issue of being designed for many reads, few writes.

  • Rebuilding the spellchecker, pt.4: Introduction to suggest algorithm
    3 projects | dev.to | 22 Jan 2021
    Some of the modern approaches to spellchecking still take this road: for example, SymSpell algorithm (claiming to be "1 million times faster") is at its core just a brilliant idea for a novel storage format for a flat word list, that allows optimizing the calculation of edit distance significantly.
  • Rebuilding the spellchecker, pt.3: Lookup–compounds and solutions
    7 projects | news.ycombinator.com | 15 Jan 2021
    https://github.com/wolfgarbe/SymSpell lists 5 JS implementations (+ a Rust one that compiles to web assembly)

What are some alternatives?

When comparing NLP-progress and SymSpell you can also consider the following projects:

JamSpell - Modern spell checking library - accurate, fast, multi-language

hunspell - The most popular spellchecking library.

wtpsplit - Code for Where's the Point? Self-Supervised Multilingual Punctuation-Agnostic Sentence Segmentation

nlp_tasks - Natural Language Processing Tasks and References

languagetool - Style and Grammar Checker for 25+ Languages

SymSpell - A JavaScript implementation of the Symmetric Delete spelling correction algorithm.

ruby-spellchecker - Fast English spelling and grammar checker that can be used for autocorrection.

awesome-hungarian-nlp - A curated list of NLP resources for Hungarian

nlprule - A fast, low-resource Natural Language Processing and Text Correction library written in Rust.

OPUS-MT-train - Training open neural machine translation models

tldr-transformers - The "tl;dr" on a few notable transformer papers (pre-2022).