Python word-embeddings

Open-source Python projects categorized as word-embeddings | Edit details

Top 6 Python word-embedding Projects

  • gensim

    Topic Modelling for Humans

    Project mention: Topic modelling with Gensim and SpaCy on startup news | dev.to | 2022-01-17

    For the topic modelling itself, I am going to use Gensim library by Radim Rehurek, which is very developer friendly and easy to use.

  • flair

    A very simple framework for state-of-the-art Natural Language Processing (NLP)

    Project mention: The Spacy NER model for Spanish is terrible | reddit.com/r/LanguageTechnology | 2021-12-20

    Had the same experience with the german model in spacy (but tbh, the quailty of my textdata was bad). A bert based approach with flair really improved my results. I think there is a spanish pretrained model also available

  • Scout APM

    Less time debugging, more time building. Scout APM allows you to find and fix performance issues with no hassle. Now with error monitoring and external services monitoring, Scout is a developer's best friend when it comes to application development.

  • scattertext

    Beautiful visualizations of how language differs among document types.

    Project mention: Clustering of text - Where to start? | reddit.com/r/LanguageTechnology | 2021-08-04

    If what you want is to determine how similar two categories are, or to learn something about the structure or words that compose those categories, you might consider word shift graphs or Scattertext.

  • Top2Vec

    Top2Vec learns jointly embedded topic, document and word vectors.

    Project mention: Extracting topics from 250k facebook posts | reddit.com/r/LanguageTechnology | 2021-05-26

    Since you already have the facebook posts, you can use top2vec https://github.com/ddangelov/Top2Vec

  • magnitude

    A fast, efficient universal vector embedding utility package.

    Project mention: Text Classification Library for a Quick Baseline | news.ycombinator.com | 2021-06-23

    (3) FastText now supports multiple languages [2].

    [1] https://github.com/plasticityai/magnitude#pre-converted-magn...

  • inltk

    Natural Language Toolkit for Indic Languages aims to provide out of the box support for various NLP tasks that an application developer might need

    Project mention: Which are top APIs for Indian languages mainly VR, OCR, Speech - Text - Speech? | reddit.com/r/LanguageTechnology | 2021-01-29

    The best tool will vary a little bit from language to language, but your best bets are probably the Indic NLP Library and iNLTK

NOTE: The open source projects on this list are ordered by number of github stars. The number of mentions indicates repo mentiontions in the last 12 Months or since we started tracking (Dec 2020). The latest post mention was on 2022-01-17.

Python word-embeddings related posts

Index

What are some of the best open-source word-embedding projects in Python? This list will help you:

Project Stars
1 gensim 12,861
2 flair 11,159
3 scattertext 1,744
4 Top2Vec 1,510
5 magnitude 1,500
6 inltk 746
Find remote jobs at our new job board 99remotejobs.com. There are 29 new remote jobs listed recently.
Are you hiring? Post a new remote job listing for free.
Static code analysis for 29 languages.
Your projects are multi-language. So is SonarQube analysis. Find Bugs, Vulnerabilities, Security Hotspots, and Code Smells so you can release quality code every time. Get started analyzing your projects today for free.
www.sonarqube.org