embedding

Top 22 embedding Open-Source Projects

  • PaddleNLP

    👑 Easy-to-use and powerful NLP and LLM library with 🤗 Awesome model zoo, supporting wide-range of NLP tasks from research to industrial applications, including 🗂Text Classification, 🔍 Neural Search, ❓ Question Answering, ℹ️ Information Extraction, 📄 Document Intelligence, 💌 Sentiment Analysis etc.

  • myGPTReader

    A community-driven way to read and chat with AI bots - powered by chatGPT.

  • InfluxDB

    Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

    InfluxDB logo
  • awesome-community-detection

    A curated list of community detection research papers with implementations.

  • infinity

    The AI-native database built for LLM applications, providing incredibly fast full-text and vector search (by infiniflow)

  • Project mention: Speeding up C++ build times | news.ycombinator.com | 2024-04-28

    Using C++20 modules, take a look at this project: https://github.com/infiniflow/infinity

  • modelfusion

    The TypeScript library for building AI applications.

  • Project mention: Next.js and GPT-4: A Guide to Streaming Generated Content as UI Components | dev.to | 2024-01-25

    ModelFusion is an AI integration library that I am developing. It enables you to integrate AI models into your JavaScript and TypeScript applications. You can install it with the following command:

  • MyScaleDB

    An open-source, high-performance SQL vector database built on ClickHouse.

  • Project mention: Myscaledb: Open-source SQL vector database to build AI apps using SQL | news.ycombinator.com | 2024-04-01
  • Angular-Penalty-Softmax-Losses-Pytorch

    Angular penalty loss functions in Pytorch (ArcFace, SphereFace, Additive Margin, CosFace)

  • SaaSHub

    SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives

    SaaSHub logo
  • ChatGptNet

    A ChatGPT integration library for .NET, supporting both OpenAI and Azure OpenAI Service

  • word2vec

    Go library for performing computations in word2vec binary models (by sajari)

  • pyTsetlinMachine

    Implements the Tsetlin Machine, Convolutional Tsetlin Machine, Regression Tsetlin Machine, Weighted Tsetlin Machine, and Embedding Tsetlin Machine, with support for continuous features, multigranularity, clause indexing, and literal budget

  • amazon-denseclus

    Clustering for mixed-type data

  • hn-recommendation-api

    A recommendation system for Hacker News. Get the most similar posts for a given URL

  • Project mention: Show HN: HN recommend – A recommendation engine for Hacker News | news.ycombinator.com | 2023-06-19
  • json-repair

    🔧 Repair JSON!Solution for JSON Anomalies from LLMs.

  • Project mention: Repair JSON!Solution for JSON Anomalies from LLMs | news.ycombinator.com | 2024-04-21

    https://github.com/RealAlexandreAI/json-repair

    Discover the power of seamless JSON string repair with the json-repair library by RealAlexandreAI, available on GitHub. This ingenious tool is designed to effortlessly mend corrupted JSON strings generated by Large Language Models (LLMs), ensuring your data integrity remains uncompromised.

    Key Features:

    - Zero Dependencies: The json-repair library prides itself on having zero external dependencies, making it incredibly lightweight and easy to integrate into your projects.

    - Universal Compatibility: It is fully compatible with all versions of Go, ensuring that developers across the board can benefit from its capabilities without worrying about version conflicts.

    - Workflow Efficiency: Perfectly suited for incorporation into your workflow, json-repair guarantees non-disruptive operation. It smoothly handles all input and output, allowing your processes to continue without any hiccups.

  • AnnA_Anki_neuronal_Appendix

    Using machine learning on your anki collection to enhance the scheduling via semantic clustering and semantic similarity

  • augmented-interpretable-models

    Interpretable and efficient predictors using pre-trained language models. Scikit-learn compatible.

  • GLOM-TensorFlow

    An attempt at the implementation of GLOM, Geoffrey Hinton's paper for emergent part-whole hierarchies from data

  • exembed

    Go Embed experiments

  • exn

    Write and render rich, scriptable, and interactive notes

  • dawnsearch

    The open source distributed web search engine that searches by meaning.

  • Project mention: DawnSearch - Open source distributed web search experiment | /r/opensource | 2023-08-08

    Hi everyone! Over the past weeks I've been working on DawnSearch, an open source distributed web search engine, using semantic (vector) search. It is written in Rust and uses some amazing libraries, like rust-bert and USearch.

  • autoembedder

    PyTorch autoencoder with additional embeddings layer for categorical data 🚘

  • Project mention: How to learn Categorial Embeddings in Unsupervised Learning? | /r/deeplearning | 2023-06-25

    Solutions I found here and here propose to save the Input Batch as a in a variable after feeding it into the Embeddings Layer (but before the AE) and use that as the target for the loss function.

  • SHREC2023-ANIMAR

    Source codes of team TikTorch (1st place solution) for track 2 and 3 of the SHREC2023 Challenge

  • llm-linker

    :link: LLM links to everything !

  • SaaSHub

    SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives

    SaaSHub logo
NOTE: The open source projects on this list are ordered by number of github stars. The number of mentions indicates repo mentiontions in the last 12 Months or since we started tracking (Dec 2020).

embedding related posts

  • Myscaledb: Open-source SQL vector database to build AI apps using SQL

    1 project | news.ycombinator.com | 1 Apr 2024
  • Show HN: High-Performance SQL Vector Database MyScaleDB Goes Open Source

    1 project | news.ycombinator.com | 1 Apr 2024
  • Show HN: My recommendation engine for Hacker News

    1 project | /r/patient_hackernews | 19 Jun 2023
  • Show HN: My recommendation engine for Hacker News

    1 project | /r/hackernews | 19 Jun 2023
  • Show HN: HN recommend A recommendation engine for Hacker News

    1 project | /r/hypeurls | 19 Jun 2023
  • Show HN: HN recommend – A recommendation engine for Hacker News

    6 projects | news.ycombinator.com | 19 Jun 2023
  • GitHub - madawei2699/myGPTReader: myGPTReader is a bot on Slack that can read and summarize any webpage, documents including ebooks, or even videos from YouTube. It can communicate with you through voice. (a Python project)

    1 project | /r/Python | 30 Mar 2023
  • A note from our sponsor - SaaSHub
    www.saashub.com | 17 May 2024
    SaaSHub helps you find the best software and product alternatives Learn more →

Index

What are some of the best open-source embedding projects? This list will help you:

Project Stars
1 PaddleNLP 11,515
2 myGPTReader 4,399
3 awesome-community-detection 2,274
4 infinity 1,793
5 modelfusion 952
6 MyScaleDB 665
7 Angular-Penalty-Softmax-Losses-Pytorch 476
8 ChatGptNet 283
9 word2vec 187
10 pyTsetlinMachine 122
11 amazon-denseclus 90
12 hn-recommendation-api 79
13 json-repair 65
14 AnnA_Anki_neuronal_Appendix 57
15 augmented-interpretable-models 37
16 GLOM-TensorFlow 36
17 exembed 30
18 exn 11
19 dawnsearch 10
20 autoembedder 8
21 SHREC2023-ANIMAR 6
22 llm-linker 2

Sponsored
SaaSHub - Software Alternatives and Reviews
SaaSHub helps you find the best software and product alternatives
www.saashub.com