Top 6 Python word-embedding Projects
Topic Modelling for HumansProject mention: Gensim: Topic Modelling for Humans | news.ycombinator.com | 2021-12-07
A very simple framework for state-of-the-art Natural Language Processing (NLP)Project mention: How to create a dataset for training NER models when you only have entity data | reddit.com/r/LanguageTechnology | 2021-10-18
We have a list of entities in text files separated with a new line. We intend to train the flair model to detect these entities in text, but NER models require the entity to be labeled in a paragraph with BOI format.
Run Linux Software Faster and Safer than Linux with Unikernels.
Beautiful visualizations of how language differs among document types.Project mention: Clustering of text - Where to start? | reddit.com/r/LanguageTechnology | 2021-08-04
If what you want is to determine how similar two categories are, or to learn something about the structure or words that compose those categories, you might consider word shift graphs or Scattertext.
A fast, efficient universal vector embedding utility package.Project mention: Text Classification Library for a Quick Baseline | news.ycombinator.com | 2021-06-23
(3) FastText now supports multiple languages .
Top2Vec learns jointly embedded topic, document and word vectors.Project mention: Extracting topics from 250k facebook posts | reddit.com/r/LanguageTechnology | 2021-05-26
Since you already have the facebook posts, you can use top2vec https://github.com/ddangelov/Top2Vec
Natural Language Toolkit for Indic Languages aims to provide out of the box support for various NLP tasks that an application developer might needProject mention: Which are top APIs for Indian languages mainly VR, OCR, Speech - Text - Speech? | reddit.com/r/LanguageTechnology | 2021-01-29
The best tool will vary a little bit from language to language, but your best bets are probably the Indic NLP Library and iNLTK
Python word-embeddings related posts
Clustering of text - Where to start?
1 project | reddit.com/r/LanguageTechnology | 4 Aug 2021
Extracting topics from 250k facebook posts
1 project | reddit.com/r/LanguageTechnology | 26 May 2021
[Data] Principali parole degli ultimi (circa) 200 post sul sub
4 projects | reddit.com/r/italy | 27 Apr 2021
SOTA for Topic Modeling
2 projects | reddit.com/r/LanguageTechnology | 25 Mar 2021
[P] Information Retrieval and Event Prediction from Unstructured Document Corpus
1 project | reddit.com/r/MachineLearning | 18 Feb 2021
Clustering text embeddings: TF-IDF + BERT Sentence Embeddings [P]
2 projects | reddit.com/r/MachineLearning | 8 Feb 2021
Sunday Daily Thread: What's everyone working on this week?
3 projects | reddit.com/r/Python | 6 Feb 2021
What are some of the best open-source word-embedding projects in Python? This list will help you:
Are you hiring? Post a new remote job listing for free.