-
InfluxDB
Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
-
txtai
💡 All-in-one open-source embeddings database for semantic search, LLM orchestration and language model workflows
-
SaaSHub
SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives
I filed an issue a few hours ago. https://github.com/motifland/markprompt/issues/5 (for asciidoc)
A lot of similar projects
- https://github.com/microsoft/semantic-kernel/tree/main/sampl...
- https://www.producthunt.com/posts/gitterbot-io-conversationa...
- https://github.com/neuml/txtai/blob/master/examples/03_Build...
- https://github.com/openai/openai-cookbook/blob/main/examples...
A lot of similar projects
- https://github.com/microsoft/semantic-kernel/tree/main/sampl...
- https://www.producthunt.com/posts/gitterbot-io-conversationa...
- https://github.com/neuml/txtai/blob/master/examples/03_Build...
- https://github.com/openai/openai-cookbook/blob/main/examples...
A lot of similar projects
- https://github.com/microsoft/semantic-kernel/tree/main/sampl...
- https://www.producthunt.com/posts/gitterbot-io-conversationa...
- https://github.com/neuml/txtai/blob/master/examples/03_Build...
- https://github.com/openai/openai-cookbook/blob/main/examples...
If you have a small number of fixed documents e.g. <100k or so, then I agree that pickling the vectors or storing them as bytearrays would work better.
Once you reach a certain scale, it's helpful to potentially use distributed querying and/or different index types, even if you have a fairly static dataset. You can check out a billion-scale search benchmark we recently did here: https://zilliz.com/resources/milvus-performance-benchmark (you'll need to supply your email unfortunately). Here's the framework we used as well: https://github.com/zilliztech/vectordb-benchmark
Related posts
-
Show HN: FileKitty – Combine and label text files for LLM prompt contexts
-
SB-1047 will stifle open-source AI and decrease safety
-
Show HN: Plandex – an AI coding engine for complex tasks
-
Show HN: LlamaGym – fine-tune LLM agents with online reinforcement learning
-
Rabbit R1, Designed by Teenage Engineering