Automatically exported from code.google.com/p/word2vec (by tmikolov)

Word2vec Alternatives

Similar projects and alternatives to word2vec

NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a better word2vec alternative or higher similarity.

word2vec discussion

Log in or Post with

word2vec reviews and mentions

Posts with mentions or reviews of word2vec. We have used some of these posts to build our list of alternatives and similar projects. The last one was on 2024-03-12.
  • Is Cosine-Similarity of Embeddings Really About Similarity?
    2 projects | news.ycombinator.com | 12 Mar 2024
    The original paper included source, and that has their test data and results -- it gets ~77% accuracy on about 20k example word analogies (with 99.7% coverage), and 78% accuracy with phrases with 77% coverage. You can see the test set here:


  • Introduction to K-Means Clustering
    5 projects | news.ycombinator.com | 14 Mar 2022
    It is not necessarily the case.

    For example, word2vec uses k-means clustering using cosine similarity measure [1]. It works very, very well. The caveat is not many optimization variations of k-means will work with that "distance".

    [1] https://github.com/tmikolov/word2vec/blob/master/word2vec.c#...


Basic word2vec repo stats
over 1 year ago

tmikolov/word2vec is an open source project licensed under Apache License 2.0 which is an OSI approved license.

The primary programming language of word2vec is C.

Power Real-Time Data Analytics at Scale
Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.