|1 day ago||6 days ago|
|BSD 3-clause "New" or "Revised" License||MIT License|
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
[D] Are you seeing any compelling use cases of semantic search being leveraged at scale?
3 projects | reddit.com/r/MachineLearning | 29 Nov 2021
[P] Effects of Metadata filtering with HNSW on Recall and Query time
1 project | reddit.com/r/MachineLearning | 21 Oct 2021
https://github.com/semi-technologies/weaviate claims that they can combine vector search along with symbolic queries using a GraphQL like API.
jina VS Weaviate - a user suggested alternative
2 projects | 1 Sep 2021
Solves a similar problem: Vector search for any media type, but does it with a resilient and highly-scalable architecture.
How an open-source side-project (the Weaviate vector search engine) turned into a full CRUD ANN database
1 project | reddit.com/r/opensource | 30 Aug 2021
Weaviate on Github: https://github.com/semi-technologies/weaviate
How an open-source side-project (Weaviate vector search engine) turned into a full CRUD ANN database
1 project | reddit.com/r/opensource | 30 Aug 2021
Vector Search Indexes
5 projects | news.ycombinator.com | 11 Aug 2021
Weaviate – an open-source vector search engine (/w semantic search and Q&A support)
2 projects | reddit.com/r/LanguageTechnology | 24 Jun 2021
Last but not least, software docs :-)2 projects | reddit.com/r/LanguageTechnology | 24 Jun 2021
We are building a vector database, this is how we do it
1 project | reddit.com/r/Database | 24 Jun 2021
I've posted on this subreddit recently but I wanted to post one more thing. We are working on this new type of database (a vector DB) called Weaviate and in this video, it is explained how Weaviate works under the hood.
We are working on a new type of database: a vector search engine called Weaviate
4 projects | reddit.com/r/Database | 17 Jun 2021
Hi there! I wanted to share something about Weaviate, a new type of OSS database that we are working on, it is a vector database (or, vector search engine to be precise).The core two use cases are semantic search (including question answering as you can see in the image below) and automatic classification. There is an open-source core on Github and you can find the documentation here. Would love to hear your thoughts and feedback!
How should I name my package repository, when there are multiple implementations in other languages in my github account?
2 projects | reddit.com/r/golang | 27 Nov 2021
There is also an option to have a custom domain like go.etcd.io/bbolt
My open source notification Android app and server can now be fully self-hosted
14 projects | reddit.com/r/selfhosted | 19 Nov 2021
Remove sqlite or replace with bbolt
Best way to store logs?
3 projects | reddit.com/r/golang | 29 Oct 2021
I would used some embedded kv store like go-leveldb or bolt. Key is BigEnding timestamp + optional tail to allow duplicate timestamps.
Introducing Weaviate, a fast modular vector search engine with out of the box support for state-of-the-art ML models written in Go
5 projects | reddit.com/r/golang | 12 Apr 2021
We initially went for using bolt (and later bbolt) which is the same store that powers etcd. It uses a B+tree approach. It worked great, but we're not happy with the write performance. We're currently in the process of switching to a custom LSM-tree-based approach, which is what you typically find in DBs with great write performance (e.g. Cassandra).
Scaling Kubernetes with Assurance at Pinterest
3 projects | reddit.com/r/kubernetes | 8 Apr 2021
Debuggability: We also built etcd data storage analysis tool using boltdb library and flamegraph to visualize data storage breakdown. Page cache used by kube-apiserver is counted towards a cgroup’s memory limit, and anonymous memory usage can steal page cache usage for the same cgroup. (force cgroup to reclaim as many pages reclaimed as possible if memory usage is approaching limit.)
Golang web framework for fast development?
6 projects | reddit.com/r/golang | 18 Mar 2021
But how, exactly, databases use mmap?
3 projects | dev.to | 20 Jan 2021
The original bolt repository is no longer maintained. A fork of bolt called bbolt is maintained and used by etcd. If you are not familiar with bolt, I recommend the articles Intro to BoltDB: Painless Performant Persistence and Bolt — an embedded key/value database for Go .
What are some alternatives?
milvus - An open-source vector database for embedding similarity search and AI applications.
jina - Cloud-native neural search framework for 𝙖𝙣𝙮 kind of data
go - The Go programming language
badger - Fast key-value DB in Go.
tiedot - A rudimentary implementation of a basic document (NoSQL) database in Go
goleveldb - LevelDB key/value database in Go.
Bitcask - 🔑 A high performance Key/Value store written in Go with a predictable read/write performance and high throughput. Uses a Bitcask on-disk layout (LSM+WAL) similar to Riak.
faiss - A library for efficient similarity search and clustering of dense vectors.
vald - Vald. A Highly Scalable Distributed Vector Search Engine
cockroach - CockroachDB - the open source, cloud-native distributed SQL database.
buntdb - BuntDB is an embeddable, in-memory key/value database for Go with custom indexing and geospatial support