similarity-search-kit
Elasticsearch
similarity-search-kit | Elasticsearch | |
---|---|---|
4 | 91 | |
278 | 68,069 | |
- | 1.3% | |
6.5 | 10.0 | |
about 1 month ago | 3 days ago | |
Swift | Java | |
Apache License 2.0 | GNU General Public License v3.0 or later |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
similarity-search-kit
- Similarity-search-kit: on-device text embeddings and semantic search in Swift
- Show HN: SimilaritySearchKit – A package for local text embeddings with CoreML
-
What Is a Vector Database
How are you guys thinking about the embedding generation side of things? It seems like that part has a generally hefty compute cost before it even gets into the index - I just open sourced a swift package to try to make that part as easy as possible, the example project exports directly to pinecone. https://github.com/ZachNagengast/similarity-search-kit
-
I built a knowledge retrieval library in Swift, looking for feedback 🕵️
Check it out here: https://github.com/ZachNagengast/similarity-search-kit
Elasticsearch
-
Elasticsearch Version 9
You could check out their GitHub and see what is going on https://github.com/elastic/elasticsearch/issues
- One .gitignore to rule them all
-
Who's hiring developer advocates? (October 2023)
Link to GitHub -->
-
Do we think about vector dbs wrong?
I believe the 1024 limit has been upped in recent versions of Elasticsearch
https://github.com/elastic/elasticsearch/issues/92458
-
Elasticsearch VS openobserve - a user suggested alternative
2 projects | 30 Aug 2023
- A dedicated Elasticsearch query language (ES|QL)
- Fleet datastreams: custom index templates
-
Integrating Elasticsearch with Node.js Applications
Elasticsearch is written in Java and its source code is available on Github.
-
Murmur3 hash plugin for nested objects?
I don't think the murmur3 hash implementation has changed since it was added as the default in version 2.0 (see the [changes](https://github.com/elastic/elasticsearch/commits/main/server/src/main/java/org/elasticsearch/cluster/routing/Murmur3HashFunction.java)). The plugin itself has seen [more changes](https://github.com/elastic/elasticsearch/commits/main/plugins/mapper-murmur3) but that's IMO because of internals and not visible changes in the calculations.
-
Mongo or Mysql for 10tb of JSON documents, I'm questioning my previous choice.
Mysql is not as open source as postgres (long story). And you can see how open elasticsearch is by just having access to the bugs database https://github.com/elastic/elasticsearch/issue
What are some alternatives?
chroma - the AI-native open-source embedding database
OpenSearch - 🔎 Open source distributed and RESTful search engine.
pgvector - Open-source vector similarity search for Postgres
Apache Superset - Apache Superset is a Data Visualization and Data Exploration Platform [Moved to: https://github.com/apache/superset]
Victor - What's our vector, Victor? Victor is a toy vector database written in Go.
bleve - A modern text/numeric/geo-spatial/vector indexing library for go
lucene - Apache Lucene open-source search software
GPT4Memory
Whoosh
faiss - A library for efficient similarity search and clustering of dense vectors.
MeiliSearch - A lightning-fast search API that fits effortlessly into your apps, websites, and workflow