allennlp vs inltk

allennlp

An open-source NLP research library, built on PyTorch. (by allenai)

DISCONTINUED

inltk

Natural Language Toolkit for Indic Languages aims to provide out of the box support for various NLP tasks that an application developer might need (by goru001)

NLP Deep Learning indic-languages Pytorch data-augmentation sentence-similarity sentence-encoding word-embeddings sentence-embeddings

Source Code

inltk.readthedocs.io

Suggest alternative

Edit details

Our great sponsors

InfluxDB - Power Real-Time Data Analytics at Scale

WorkOS - The modern identity platform for B2B SaaS

SaaSHub - Software Alternatives and Reviews

Our great sponsors

allennlp		inltk
	Project
13	Mentions	1
11,337	Stars	811
-	Growth	-
8.4	Activity	0.0
over 1 year ago	Latest Commit	3 months ago
Python	Language	Python
Apache License 2.0	License	MIT License

The number of mentions indicates the total number of mentions that we've tracked plus the number of user suggested alternatives.
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.

allennlp

Posts with mentions or reviews of allennlp. We have used some of these posts to build our list of alternatives and similar projects. The last one was on 2022-07-11.

How to solve ConfigurationError using HuggingFace Token Classifier
1 project | /r/learnpython | 8 Oct 2022

No clue. So what I did was google the error. Here's what I found: https://github.com/allenai/allennlp/issues/4319
AllenNLP will be unmaintained in December
1 project | /r/hypeurls | 11 Jul 2022

6 projects | news.ycombinator.com | 11 Jul 2022
AllenNLP Is EOL
1 project | news.ycombinator.com | 10 Jul 2022
Any recommendation for the replacement of the toolkit jiant? [Research] [Discussion]
3 projects | /r/MachineLearning | 11 Jun 2022
Cedille, the largest French language model, open source with a freely accessible playground
3 projects | /r/GPT3 | 12 Nov 2021
[P] Cedille, the largest French language model (6b), released in open source
5 projects | /r/MachineLearning | 10 Nov 2021

Another aspect we had fun with is dataset filtering. We have run the whole C4 French dataset through the Detoxify classifier to clean it up 🤬
Any allennlp users in this sub?
1 project | /r/LanguageTechnology | 8 Oct 2021

https://github.com/allenai/allennlp/discussions looks active
Multilingual C4 (mC4) Dataset now released
1 project | /r/Multimodal | 17 Jun 2021
C4 dataset released (800GB Common Crawl-derived text; T5 training data)
1 project | /r/mlscaling | 16 Mar 2021

inltk

Posts with mentions or reviews of inltk. We have used some of these posts to build our list of alternatives and similar projects.

Which are top APIs for Indian languages mainly VR, OCR, Speech - Text - Speech?
1 project | /r/LanguageTechnology | 29 Jan 2021

The best tool will vary a little bit from language to language, but your best bets are probably the Indic NLP Library and iNLTK

What are some alternatives?

When comparing allennlp and inltk you can also consider the following projects:

cedille-ai - ✒️ Cedille is a large French language model (6B), released under an open-source license

DiffCSE - Code for the NAACL 2022 long paper "DiffCSE: Difference-based Contrastive Learning for Sentence Embeddings"

fairseq - Facebook AI Research Sequence-to-Sequence Toolkit written in Python.

SimCSE - [EMNLP 2021] SimCSE: Simple Contrastive Learning of Sentence Embeddings https://arxiv.org/abs/2104.08821

mesh-transformer-jax - Model parallel transformers in JAX and Haiku

smaller-labse - Applying "Load What You Need: Smaller Versions of Multilingual BERT" to LaBSE

lm-evaluation-harness - A framework for few-shot evaluation of language models.

clip-as-service - 🏄 Scalable embedding, reasoning, ranking for images and sentences with CLIP

python-sutime - Python wrapper for Stanford CoreNLP's SUTime

KitanaQA - KitanaQA: Adversarial training and data augmentation for neural question-answering models

PaddleHub - Awesome pre-trained models toolkit based on PaddlePaddle. (400+ models including Image, Text, Audio, Video and Cross-Modal with Easy Inference & Serving)

ModelNet40-C - Repo for "Benchmarking Robustness of 3D Point Cloud Recognition against Common Corruptions" https://arxiv.org/abs/2201.12296