Jupyter Notebook Embeddings

Open-source Jupyter Notebook projects categorized as Embeddings

Top 11 Jupyter Notebook Embedding Projects

  • featureform

    The Virtual Feature Store. Turn your existing data infrastructure into a feature store.

    Project mention: Still look familiar? | /r/u_featureform | 2023-07-13
  • awesome-generative-ai

    A curated list of Generative AI tools, works, models, and references (by filipecalegario)

    Project mention: Generative AI – A curated list of Generative AI tools, works, models | news.ycombinator.com | 2023-07-14
  • Onboard AI

    Learn any GitHub repo in 59 seconds. Onboard AI learns any GitHub repo in minutes and lets you chat with it to locate functionality, understand different parts, and generate new code. Use it for free at www.getonboard.dev.

  • Fast_Sentence_Embeddings

    Compute Sentence Embeddings Fast!

    Project mention: You probably shouldn't use OpenAI's embeddings | news.ycombinator.com | 2023-03-30

    You can find some comparisons and evaluation datasets/tasks here: https://www.sbert.net/docs/pretrained_models.html

    Generally MiniLM is a good baseline. For faster models you want this library:

    https://github.com/oborchers/Fast_Sentence_Embeddings

    For higher quality ones, just take the bigger/slower models in the SentenceTransformers library

  • what_are_embeddings

    A deep dive into embeddings starting from fundamentals

    Project mention: GPT Weekly - 3rd July Edition - Adobe’s Safety Net, Open-Source AI: Expanded Context Lengths and more. | /r/ChatGPTPromptGenius | 2023-07-03

    LLMs require embeddings to work. What are embeddings? But, did you know embeddings also power the recommendation engines? Another guide on embeddings.

  • cleora

    Cleora AI is a general-purpose model for efficient, scalable learning of stable and inductive entity embeddings for heterogeneous relational data.

  • kgtk

    Knowledge Graph Toolkit

  • examples

    Analyze the unstructured data with Towhee, such as reverse image search, reverse video search, audio classification, question and answer systems, molecular search, etc. (by towhee-io)

    Project mention: FLaNK Stack Weekly for 07August2023 | dev.to | 2023-08-07
  • SonarLint

    Clean code begins in your IDE with SonarLint. Up your coding game and discover issues early. SonarLint is a free plugin that helps you find & fix bugs and security issues from the moment you start writing code. Install from your favorite IDE marketplace today.

  • Research2Vec

    Representing research papers as vectors / latent representations.

  • entity-embed

    PyTorch library for transforming entities like companies, products, etc. into vectors to support scalable Record Linkage / Entity Resolution using Approximate Nearest Neighbors.

  • embedding-encoder

    Scikit-Learn compatible transformer that turns categorical variables into dense entity embeddings.

  • emotion-classifier

    An attention-based BiLSTM for emotion classification.

NOTE: The open source projects on this list are ordered by number of github stars. The number of mentions indicates repo mentiontions in the last 12 Months or since we started tracking (Dec 2020). The latest post mention was on 2023-08-07.

Jupyter Notebook Embeddings related posts

Index

What are some of the best open-source Embedding projects in Jupyter Notebook? This list will help you:

Project Stars
1 featureform 1,520
2 awesome-generative-ai 1,349
3 Fast_Sentence_Embeddings 594
4 what_are_embeddings 534
5 cleora 450
6 kgtk 313
7 examples 250
8 Research2Vec 192
9 entity-embed 128
10 embedding-encoder 38
11 emotion-classifier 5
Collect and Analyze Billions of Data Points in Real Time
Manage all types of time series data in a single, purpose-built database. Run at any scale in any environment in the cloud, on-premises, or at the edge.
www.influxdata.com