lemmatization-lists

Machine-readable lists of lemma-token pairs in 23 languages. (by michmech)

Lemmatization-lists Alternatives

Similar projects and alternatives to lemmatization-lists based on common topics and language

NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a better lemmatization-lists alternative or higher similarity.

lemmatization-lists reviews and mentions

Posts with mentions or reviews of lemmatization-lists. We have used some of these posts to build our list of alternatives and similar projects. The last one was on 2023-02-07.
  • Ambiguous spellings
    2 projects | /r/Redactle | 7 Feb 2023
    It's a bit of a massive undertaking maintaining such a data set so it's mostly taken from https://github.com/michmech/lemmatization-lists At the top of the file you'll see some additional I've added to deal with personal pronouns and numbers.
  • Is there a text list of words and their variations?
    1 project | /r/LanguageTechnology | 8 Jun 2021
    Another one to add to your list: https://github.com/michmech/lemmatization-lists
  • Trying to build a lemmatizer from scratch
    1 project | /r/LanguageTechnology | 23 Dec 2020
    One approach might be to take a lemmatization list, like the lemma-token lists at https://github.com/michmech/lemmatization-lists/, and compile it into a Finite State Transducer. The Helsinki FST package, for instance, has an hfst-strings2fst command to compile pairs of strings into a transducer. You might need to do some reformatting of the input first.
  • A note from our sponsor - SaaSHub
    www.saashub.com | 29 Apr 2024
    SaaSHub helps you find the best software and product alternatives Learn more →

Stats

Basic lemmatization-lists repo stats
3
303
0.0
about 2 years ago

Sponsored
SaaSHub - Software Alternatives and Reviews
SaaSHub helps you find the best software and product alternatives
www.saashub.com