Python Embeddings

Open-source Python projects categorized as Embeddings | Edit details

Top 16 Python Embedding Projects

  • hub

    A library for transfer learning by reusing parts of TensorFlow models. (by tensorflow)

    Project mention: Tensorflow Custom TFLite java.lang.NullPointerException: Cannot allocate memory for the interpreter | | 2022-05-14

    I have created a custom tensorflow lite model using from using the following command

  • PyTorch-NLP

    Basic Utilities for PyTorch Natural Language Processing (NLP)

    Project mention: Introduction to PyTorch | | 2022-05-02


  • Scout APM

    Less time debugging, more time building. Scout APM allows you to find and fix performance issues with no hassle. Now with error monitoring and external services monitoring, Scout is a developer's best friend when it comes to application development.

  • lightly

    A python library for self-supervised learning on images.

    Project mention: Self-Supervised Models are More Robust and Fair | | 2022-04-07

    If you’re interested in self-supervised learning and want to try it out yourself you can check out our open-source repository for self-supervised learning.

  • magnitude

    A fast, efficient universal vector embedding utility package.

    Project mention: Text Classification Library for a Quick Baseline | | 2021-06-23

    (3) FastText now supports multiple languages [2].


  • eda_nlp

    Data augmentation for NLP, presented at EMNLP 2019

    Project mention: Getting Identical Results with Different Models | | 2022-03-17

    Code for found:

  • contextualized-topic-models

    A python package to run contextualized topic modeling. CTMs combine contextualized embeddings (e.g., BERT) with topic models to get coherent topics. Published at EACL and ACL 2021.

    Project mention: Using Transformer for Topic Modeling - what are the options? | | 2022-02-15

    This library from MILA seems quite neat! I haven’t had the change to play with it though :

  • PolyFuzz

    Fuzzy string matching, grouping, and evaluation.

  • SonarQube

    Static code analysis for 29 languages.. Your projects are multi-language. So is SonarQube analysis. Find Bugs, Vulnerabilities, Security Hotspots, and Code Smells so you can release quality code every time. Get started analyzing your projects today for free.

  • towhee

    A framework that provides a simple API for developing ML-driven data processing and search pipelines.

    Project mention: A quick tip on DataFrame.apply | | 2022-05-16

    The project's homepage is, and you can find more about towhee by going through the documents.

  • CX_DB8

    a contextual, biasable, word-or-sentence-or-paragraph extractive summarizer powered by the latest in text embeddings (Bert, Universal Sentence Encoder, Flair)

    Project mention: Haystack 1.0 – open-source NLP framework to build NLProc back end applications | | 2021-12-09

    Is there any path forward to make Haystack do word-level extractive summarization? e.g. like this:

    or like this:

    I am trying to find anything better than these two for this task. I feel like Haystack could be an option - but I am not sure.

  • laserembeddings

    LASER multilingual sentence embeddings as a pip package

  • pyRDF2Vec

    🐍 Python Implementation and Extension of RDF2Vec

  • imgbeddings

    Python package to generate image embeddings with CLIP without PyTorch/TensorFlow

    Project mention: GitHub - minimaxir/imgbeddings: Python package to generate image embeddings with CLIP without PyTorch/TensorFlow | | 2022-04-02
  • multimodal

    A collection of multimodal datasets, and visual features for VQA and captionning in pytorch. Just run "pip install multimodal" (by cdancette)

  • wembedder

    Wikidata embedding

    Project mention: [D] Graph embeddings of Wikidata items | | 2021-05-28

    I have made Wembedder that is using a simple RDF2Vec model, that you might try. You can download it from The current pre-trained model running at is pretty small with only around 600.000 Wikidata items to fit the size of the Toolforge cloud service. It means that the Python programming language is in the model, but not the snake nor Django :/.

  • embedded-topic-model

    A package to run embedded topic modelling with ETM. Adapted from the original at:

    Project mention: BERTopic is the future of topic modeling in NLP | | 2022-05-11
  • Sentimentanalysis

    Language independent sentiment analysis

NOTE: The open source projects on this list are ordered by number of github stars. The number of mentions indicates repo mentiontions in the last 12 Months or since we started tracking (Dec 2020). The latest post mention was on 2022-05-16.

Python Embeddings related posts


What are some of the best open-source Embedding projects in Python? This list will help you:

Project Stars
1 hub 3,103
2 PyTorch-NLP 2,058
3 lightly 1,590
4 magnitude 1,506
5 eda_nlp 1,247
6 contextualized-topic-models 860
7 PolyFuzz 463
8 towhee 443
9 CX_DB8 187
10 laserembeddings 185
11 pyRDF2Vec 150
12 imgbeddings 60
13 multimodal 52
14 wembedder 42
15 embedded-topic-model 23
16 Sentimentanalysis 8
Find remote jobs at our new job board There are 13 new remote jobs listed recently.
Are you hiring? Post a new remote job listing for free.
SaaSHub - Software Alternatives and Reviews
SaaSHub helps you find the best software and product alternatives