SaaSHub helps you find the best software and product alternatives Learn more →
Top 22 embedding Open-Source Projects
-
PaddleNLP
👑 Easy-to-use and powerful NLP and LLM library with 🤗 Awesome model zoo, supporting wide-range of NLP tasks from research to industrial applications, including 🗂Text Classification, 🔍 Neural Search, ❓ Question Answering, ℹ️ Information Extraction, 📄 Document Intelligence, 💌 Sentiment Analysis etc.
-
InfluxDB
Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
-
awesome-community-detection
A curated list of community detection research papers with implementations.
-
infinity
The AI-native database built for LLM applications, providing incredibly fast full-text and vector search (by infiniflow)
-
Angular-Penalty-Softmax-Losses-Pytorch
Angular penalty loss functions in Pytorch (ArcFace, SphereFace, Additive Margin, CosFace)
-
SaaSHub
SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives
-
pyTsetlinMachine
Implements the Tsetlin Machine, Convolutional Tsetlin Machine, Regression Tsetlin Machine, Weighted Tsetlin Machine, and Embedding Tsetlin Machine, with support for continuous features, multigranularity, clause indexing, and literal budget
-
hn-recommendation-api
A recommendation system for Hacker News. Get the most similar posts for a given URL
-
AnnA_Anki_neuronal_Appendix
Using machine learning on your anki collection to enhance the scheduling via semantic clustering and semantic similarity
-
augmented-interpretable-models
Interpretable and efficient predictors using pre-trained language models. Scikit-learn compatible.
-
GLOM-TensorFlow
An attempt at the implementation of GLOM, Geoffrey Hinton's paper for emergent part-whole hierarchies from data
-
SHREC2023-ANIMAR
Source codes of team TikTorch (1st place solution) for track 2 and 3 of the SHREC2023 Challenge
-
SaaSHub
SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives
Using C++20 modules, take a look at this project: https://github.com/infiniflow/infinity
Project mention: Next.js and GPT-4: A Guide to Streaming Generated Content as UI Components | dev.to | 2024-01-25ModelFusion is an AI integration library that I am developing. It enables you to integrate AI models into your JavaScript and TypeScript applications. You can install it with the following command:
Project mention: Myscaledb: Open-source SQL vector database to build AI apps using SQL | news.ycombinator.com | 2024-04-01
Project mention: Show HN: HN recommend – A recommendation engine for Hacker News | news.ycombinator.com | 2023-06-19
Project mention: Repair JSON!Solution for JSON Anomalies from LLMs | news.ycombinator.com | 2024-04-21https://github.com/RealAlexandreAI/json-repair
Discover the power of seamless JSON string repair with the json-repair library by RealAlexandreAI, available on GitHub. This ingenious tool is designed to effortlessly mend corrupted JSON strings generated by Large Language Models (LLMs), ensuring your data integrity remains uncompromised.
Key Features:
- Zero Dependencies: The json-repair library prides itself on having zero external dependencies, making it incredibly lightweight and easy to integrate into your projects.
- Universal Compatibility: It is fully compatible with all versions of Go, ensuring that developers across the board can benefit from its capabilities without worrying about version conflicts.
- Workflow Efficiency: Perfectly suited for incorporation into your workflow, json-repair guarantees non-disruptive operation. It smoothly handles all input and output, allowing your processes to continue without any hiccups.
Project mention: DawnSearch - Open source distributed web search experiment | /r/opensource | 2023-08-08Hi everyone! Over the past weeks I've been working on DawnSearch, an open source distributed web search engine, using semantic (vector) search. It is written in Rust and uses some amazing libraries, like rust-bert and USearch.
Project mention: How to learn Categorial Embeddings in Unsupervised Learning? | /r/deeplearning | 2023-06-25Solutions I found here and here propose to save the Input Batch as a in a variable after feeding it into the Embeddings Layer (but before the AE) and use that as the target for the loss function.
embedding related posts
-
Myscaledb: Open-source SQL vector database to build AI apps using SQL
-
Show HN: High-Performance SQL Vector Database MyScaleDB Goes Open Source
-
Show HN: My recommendation engine for Hacker News
-
Show HN: My recommendation engine for Hacker News
-
Show HN: HN recommend A recommendation engine for Hacker News
-
Show HN: HN recommend – A recommendation engine for Hacker News
-
GitHub - madawei2699/myGPTReader: myGPTReader is a bot on Slack that can read and summarize any webpage, documents including ebooks, or even videos from YouTube. It can communicate with you through voice. (a Python project)
-
A note from our sponsor - SaaSHub
www.saashub.com | 17 May 2024
Index
What are some of the best open-source embedding projects? This list will help you:
Project | Stars | |
---|---|---|
1 | PaddleNLP | 11,515 |
2 | myGPTReader | 4,399 |
3 | awesome-community-detection | 2,274 |
4 | infinity | 1,793 |
5 | modelfusion | 952 |
6 | MyScaleDB | 665 |
7 | Angular-Penalty-Softmax-Losses-Pytorch | 476 |
8 | ChatGptNet | 283 |
9 | word2vec | 187 |
10 | pyTsetlinMachine | 122 |
11 | amazon-denseclus | 90 |
12 | hn-recommendation-api | 79 |
13 | json-repair | 65 |
14 | AnnA_Anki_neuronal_Appendix | 57 |
15 | augmented-interpretable-models | 37 |
16 | GLOM-TensorFlow | 36 |
17 | exembed | 30 |
18 | exn | 11 |
19 | dawnsearch | 10 |
20 | autoembedder | 8 |
21 | SHREC2023-ANIMAR | 6 |
22 | llm-linker | 2 |
Sponsored