tldr-transformers vs lemmatization-lists

tldr-transformers

The "tl;dr" on a few notable transformer papers (pre-2022). (by will-thompson-k)

DISCONTINUED

Suggest alternative

Edit details

lemmatization-lists

Machine-readable lists of lemma-token pairs in 23 languages. (by michmech)

NLP lemmatization

Source Code

Suggest alternative

Edit details

Our great sponsors

InfluxDB - Power Real-Time Data Analytics at Scale

WorkOS - The modern identity platform for B2B SaaS

SaaSHub - Software Alternatives and Reviews

Our great sponsors

tldr-transformers		lemmatization-lists
	Project
4	Mentions	3
167	Stars	303
-	Growth	-
0.0	Activity	0.0
over 1 year ago	Latest Commit	about 2 years ago
	Language
MIT License	License	ODC Open Database License v1.0

The number of mentions indicates the total number of mentions that we've tracked plus the number of user suggested alternatives.
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.

tldr-transformers

Posts with mentions or reviews of tldr-transformers. We have used some of these posts to build our list of alternatives and similar projects. The last one was on 2021-08-12.

Show HN: The “tl;dr” of Recent Transformer Papers
1 project | news.ycombinator.com | 15 Aug 2021
Show HN: Tl;Dr” on Transformers Papers
1 project | news.ycombinator.com | 12 Aug 2021

With the explosion in research on all things transformers, it seemed there was a need to have a single table to distill the "tl;dr" of each paper's contributions relative to each other. Here is what I got so far: https://github.com/will-thompson-k/tldr-transformers . Would love feedback - and feel free to contribute too :)
[P] NLP "tl;dr" Notes on Transformers
2 projects | /r/MachineLearning | 12 Aug 2021

In any case, I'm liking the first glance so far. I'd just transpose the summary tables so they wouldn't get so tightly squeezed: https://github.com/will-thompson-k/tldr-transformers/blob/main/notes/bart.md

1 project | /r/learnmachinelearning | 12 Aug 2021

With the explosion in work on all things transformers, I felt the need to keep a single table of the "tl;dr" of various papers to distill their main takeaways: https://github.com/will-thompson-k/tldr-transformers . Would love feedback!

lemmatization-lists

Posts with mentions or reviews of lemmatization-lists. We have used some of these posts to build our list of alternatives and similar projects. The last one was on 2023-02-07.

Ambiguous spellings
2 projects | /r/Redactle | 7 Feb 2023

It's a bit of a massive undertaking maintaining such a data set so it's mostly taken from https://github.com/michmech/lemmatization-lists At the top of the file you'll see some additional I've added to deal with personal pronouns and numbers.
Is there a text list of words and their variations?
1 project | /r/LanguageTechnology | 8 Jun 2021

Another one to add to your list: https://github.com/michmech/lemmatization-lists
Trying to build a lemmatizer from scratch
1 project | /r/LanguageTechnology | 23 Dec 2020

One approach might be to take a lemmatization list, like the lemma-token lists at https://github.com/michmech/lemmatization-lists/, and compile it into a Finite State Transducer. The Helsinki FST package, for instance, has an hfst-strings2fst command to compile pairs of strings into a transducer. You might need to do some reformatting of the input first.

What are some alternatives?

When comparing tldr-transformers and lemmatization-lists you can also consider the following projects:

NLP-progress - Repository to track the progress in Natural Language Processing (NLP), including the datasets and the current state-of-the-art for the most common NLP tasks.

trankit - Trankit is a Light-Weight Transformer-based Python Toolkit for Multilingual Natural Language Processing

FARM - :house_with_garden: Fast & easy transfer learning for NLP. Harvesting language models for the industry. Focus on Question Answering.

awesome-sentiment-analysis - Repository with all what is necessary for sentiment analysis and related areas

azure-sql-db-openai - Samples on how to use Azure SQL database with Azure OpenAI

thesaurus - Offline database of synonyms/thesaurus

long-range-arena - Long Range Arena for Benchmarking Efficient Transformers

Awesome-pytorch-list - A comprehensive list of pytorch related content on github,such as different models,implementations,helper libraries,tutorials etc.

transformers-convert

language-planner - Official Code for "Language Models as Zero-Shot Planners: Extracting Actionable Knowledge for Embodied Agents"

transformers - 🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.

awesome-instruction-dataset - A collection of open-source dataset to train instruction-following LLMs (ChatGPT,LLaMA,Alpaca)

tldr-transformers vs NLP-progress lemmatization-lists vs trankit tldr-transformers vs FARM lemmatization-lists vs awesome-sentiment-analysis tldr-transformers vs azure-sql-db-openai lemmatization-lists vs thesaurus tldr-transformers vs long-range-arena lemmatization-lists vs Awesome-pytorch-list tldr-transformers vs transformers-convert tldr-transformers vs language-planner tldr-transformers vs transformers tldr-transformers vs awesome-instruction-dataset

Compare tldr-transformers vs lemmatization-lists and see what are their differences.

tldr-transformers

lemmatization-lists

tldr-transformers

lemmatization-lists

What are some alternatives?