A Spellchecker Used to Be a Major Feat of Software Engineering

Our great sponsors

InfluxDB - Power Real-Time Data Analytics at Scale

WorkOS - The modern identity platform for B2B SaaS

SaaSHub - Software Alternatives and Reviews

Our great sponsors

KeenSpell

1 1 10.0 Java

Discontinued Java 8+ zero-dependency port of SymSpell: 1 million times faster through Symmetric Delete spelling correction algorithm

In some ways, I think computational linguistics (for English) has missed a mark. We have dictionaries, lexicons, grammar engines, spell checkers, pluralization rules, and quotation disambiguation. You'd think we could roll all these into a unified, standard definition format.
My KeenWrite editor, for instance, uses:
* https://github.com/DaveJarvis/KeenSpell (spell check, lexicon)
* https://github.com/DaveJarvis/KeenQuotes (curls straight quotes)
* https://github.com/DaveJarvis/KeenWrite/blob/main/R/pluraliz... (pluralization)
I was looking at integrating LanguageTool[0] for grammar and realized that it has partial functionality for KeenQuotes (lexing and tokenization), duplicates the SymSpell algorithm used by KeenSpell, and because it offers grammar corrections it likely can pluralize words, as well.
Unifying those for English alone would be a massive undertaking.
[0]: https://github.com/languagetool-org/languagetool

KeenQuotes

3 4 0.0 Java

Discontinued Convert straight quotes to curly quotes

In some ways, I think computational linguistics (for English) has missed a mark. We have dictionaries, lexicons, grammar engines, spell checkers, pluralization rules, and quotation disambiguation. You'd think we could roll all these into a unified, standard definition format.
My KeenWrite editor, for instance, uses:
* https://github.com/DaveJarvis/KeenSpell (spell check, lexicon)
* https://github.com/DaveJarvis/KeenQuotes (curls straight quotes)
* https://github.com/DaveJarvis/KeenWrite/blob/main/R/pluraliz... (pluralization)
I was looking at integrating LanguageTool[0] for grammar and realized that it has partial functionality for KeenQuotes (lexing and tokenization), duplicates the SymSpell algorithm used by KeenSpell, and because it offers grammar corrections it likely can pluralize words, as well.
Unifying those for English alone would be a massive undertaking.
[0]: https://github.com/languagetool-org/languagetool

InfluxDB

www.influxdata.com sponsored

Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
KeenWrite

98 621 0.0 Java

Discontinued Free, open-source, cross-platform desktop Markdown text editor with live preview, string interpolation, and math.

In some ways, I think computational linguistics (for English) has missed a mark. We have dictionaries, lexicons, grammar engines, spell checkers, pluralization rules, and quotation disambiguation. You'd think we could roll all these into a unified, standard definition format.
My KeenWrite editor, for instance, uses:
* https://github.com/DaveJarvis/KeenSpell (spell check, lexicon)
* https://github.com/DaveJarvis/KeenQuotes (curls straight quotes)
* https://github.com/DaveJarvis/KeenWrite/blob/main/R/pluraliz... (pluralization)
I was looking at integrating LanguageTool[0] for grammar and realized that it has partial functionality for KeenQuotes (lexing and tokenization), duplicates the SymSpell algorithm used by KeenSpell, and because it offers grammar corrections it likely can pluralize words, as well.
Unifying those for English alone would be a massive undertaking.
[0]: https://github.com/languagetool-org/languagetool

languagetool

310 11,543 10.0 Java

Style and Grammar Checker for 25+ Languages

In some ways, I think computational linguistics (for English) has missed a mark. We have dictionaries, lexicons, grammar engines, spell checkers, pluralization rules, and quotation disambiguation. You'd think we could roll all these into a unified, standard definition format.
My KeenWrite editor, for instance, uses:
* https://github.com/DaveJarvis/KeenSpell (spell check, lexicon)
* https://github.com/DaveJarvis/KeenQuotes (curls straight quotes)
* https://github.com/DaveJarvis/KeenWrite/blob/main/R/pluraliz... (pluralization)
I was looking at integrating LanguageTool[0] for grammar and realized that it has partial functionality for KeenQuotes (lexing and tokenization), duplicates the SymSpell algorithm used by KeenSpell, and because it offers grammar corrections it likely can pluralize words, as well.
Unifying those for English alone would be a massive undertaking.
[0]: https://github.com/languagetool-org/languagetool

shitty-autoreply

1 0 10.0 Python

Irritate people by getting their names wrong

Reminds me of a dumb project I did that takes a person's name and returns a name that is close, but not quite right: https://github.com/rzimmerman/shitty-autoreply
The idea was to set up an auto-reply to my email that looked automated, but surely couldn't be because it messed up the sender's name. Not nearly as clever as Peter Norvig's program and even less useful.
I'd say the bias toward common American names is a bug, but given that the goal of the project is to be obnoxious it's probably a feature.

WorkOS

workos.com sponsored

The modern identity platform for B2B SaaS. The APIs are flexible and easy-to-use, supporting authentication, user identity, and complex enterprise features like SSO and SCIM provisioning.

NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a more popular project.

Suggest a related project

Ask HN: Grammarly Alternatives?
2 projects | news.ycombinator.com | 27 Feb 2024
Recent ECE Masters grad looking to change careers from IT to RF engineering
1 project | /r/EngineeringResumes | 29 Sep 2023
Hey guys! I have my first draft here as a first-year computer engineering student. I'm preparing for an internship fair and I'd like to have something decent. Roast me!!
1 project | /r/EngineeringResumes | 25 Sep 2023
Top 3 Free Grammar Checkers for Flawless Writing
1 project | /r/math_homework_answer | 25 Sep 2023
Существует какое-нибудь приложение похожее на Grammarly или Writeandimprove, но для русского языка?
1 project | /r/russian | 26 Jun 2023

A Spellchecker Used to Be a Major Feat of Software Engineering

This page summarizes the projects mentioned and recommended in the original post on news.ycombinator.com
Grammar Natural Language Natural Language Processing style-checker proofreading
Post date: 28 Feb 2023

KeenSpell

KeenQuotes

InfluxDB

KeenWrite

languagetool

shitty-autoreply

WorkOS

Related posts

A Spellchecker Used to Be a Major Feat of Software Engineering

This page summarizes the projects mentioned and recommended in the original post on news.ycombinator.com Grammar Natural Language Natural Language Processing style-checker proofreading Post date: 28 Feb 2023

KeenSpell

KeenQuotes

InfluxDB

KeenWrite

languagetool

shitty-autoreply

WorkOS

Related posts

This page summarizes the projects mentioned and recommended in the original post on news.ycombinator.com
Grammar Natural Language Natural Language Processing style-checker proofreading
Post date: 28 Feb 2023