Our great sponsors
-
qdrant
Qdrant - High-performance, massive-scale Vector Database for the next generation of AI. Also available in the cloud https://cloud.qdrant.io/
-
InfluxDB
Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
This is undocumented (frustrating) but it looks like it's chunking them, running embeddings on the chunks and storing the results in a https://qdrant.tech/ vector database.
We know it's Qdrant because an error message leaked that detail: https://twitter.com/altryne/status/1721989500291989585
I wrote a cli wrapper for assistants (GPTs) to make it easier to test out these features https://github.com/HumanAssistedIntelligence/OAICLI
I had some trouble forcing assistants to use the tool {"type": "retrieval"}. However, you can be explicit in your prompts and messages though, and I found it to work quite well.
Related posts
- Boost Your Code's Efficiency: Introducing Semantic Cache with Qdrant
- Qdrant 1.8.0 - Major Performance Enhancements
- Perform Image-Driven Reverse Image Search on E-Commerce Sites with ImageBind and Qdrant
- Step-by-Step Guide to Building LLM Applications with Ruby (Using Langchain and Qdrant)
- Qdrant - Using FastEmbed for Rapid Embedding Generation: A Benchmark and Guide