Python Word2vec

Open-source Python projects categorized as Word2vec

Top 13 Python Word2vec Projects

  • gensim

    Topic Modelling for Humans

  • Scout Monitoring

    Free Django app performance insights with Scout Monitoring. Get Scout setup in minutes, and let us sweat the small stuff. A couple lines in settings.py is all you need to start monitoring your apps. Sign up for our free tier today.

    Scout Monitoring logo
  • flashtext

    Extract Keywords from sentence or Replace keywords in sentences.

    Project mention: Show HN: LLMs can generate valid JSON 100% of the time | news.ycombinator.com | 2023-08-14

    I have some other comment on this thread where I point out why I don’t think it’s superficial. Would love to get your feedback on that if you feel like spending more time on this thread.

    But it’s not obscure? FlashText was a somewhat popular paper at the time (2017) with a popular repo (https://github.com/vi3k6i5/flashtext). Their paper was pretty derivative of Aho-Corasick, which they cited. If you think they genuinely fucked up, leave an issue on their repo (I’m, maybe to your surprise lol, not the author).

    Anyway, I’m not a fan of the whatabboutery here. I don’t think OG’s paper is up to snuff on its lit review - do you?

  • scattertext

    Beautiful visualizations of how language differs among document types.

  • magnitude

    A fast, efficient universal vector embedding utility package.

  • textaugment

    TextAugment: Text Augmentation Library

  • pyRDF2Vec

    🐍 Python Implementation and Extension of RDF2Vec

  • text-summarizer

    Python Framework for Extractive Text Summarization

  • InfluxDB

    Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

    InfluxDB logo
  • japanese-words-to-vectors

    Word2vec (word to vectors) approach for Japanese language using Gensim and Mecab.

  • dutch-word-embeddings

    Dutch word embeddings, trained on a large collection of Dutch social media messages and news/blog/forum posts.

  • YassQueenDB

    Graph database library that allows you to store, analyze, and search through your data in a graph format. By using the Universal Sentence Encoder, it provides an efficient and semantic approach to handle text data. 📚🧠🚀

  • Char2Vec

    Training from scratch a character embedding following Word2Vec, using tensorflow.

    Project mention: GPT-4 Can Almost Perfectly Handle Unnatural Scrambled Text | news.ycombinator.com | 2023-12-03

    There are character embeddings that allow one to recover word embedding just by summing embeddings of individual bytes/chars in the word: https://github.com/sonlamho/Char2Vec

    The encodings of LM's tokens reserve individual characters so that scrambled or new words can be encoded. And most LM's are trained on scrambled words as part of training copus, thus, they learn character-level embeddings.

    Thus, basically, the paper is a very old news. This behavior is expected.

  • recommendation-system

    Build a Content-Based Movie Recommender System (TF-IDF, BM25, BERT)

  • embeddings_plot

    A command line utility to create a plots of word embeddings

    Project mention: Word Embedding Visualization Tool | news.ycombinator.com | 2023-12-06
NOTE: The open source projects on this list are ordered by number of github stars. The number of mentions indicates repo mentiontions in the last 12 Months or since we started tracking (Dec 2020).

Python Word2vec discussion

Log in or Post with

Python Word2vec related posts

Index

What are some of the best open-source Word2vec projects in Python? This list will help you:

Project Stars
1 gensim 15,452
2 flashtext 5,574
3 scattertext 2,225
4 magnitude 1,616
5 textaugment 387
6 pyRDF2Vec 242
7 text-summarizer 114
8 japanese-words-to-vectors 83
9 dutch-word-embeddings 43
10 YassQueenDB 14
11 Char2Vec 13
12 recommendation-system 12
13 embeddings_plot 3

Sponsored
Free Django app performance insights with Scout Monitoring
Get Scout setup in minutes, and let us sweat the small stuff. A couple lines in settings.py is all you need to start monitoring your apps. Sign up for our free tier today.
www.scoutapm.com

Did you konow that Python is
the 1st most popular programming language
based on number of metions?