[D] Is there any way to filter searches by metadata over current vector DBs like Pinecone?

This page summarizes the projects mentioned and recommended in the original post on /r/MachineLearning

InfluxDB - Power Real-Time Data Analytics at Scale
Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
www.influxdata.com
featured
SaaSHub - Software Alternatives and Reviews
SaaSHub helps you find the best software and product alternatives
www.saashub.com
featured
  • formkiq-core

    A full-featured Document Layer for your application, providing the functionality of a flexible document management system, including storage, discovery, processing, and retrieval. Deploys directly into your Amazon Web Services Cloud. 🌟 Star to support our work!

  • I think that makes sense to me (biased as I am). I wonder if Milvus (mentioned in another comment) can handle some of this, or if a dedicated EDMS is required. We have created an Open Core EDMS that could provide the document management functionality running using AWS: https://github.com/formkiq/formkiq-core

  • DiscoChat

    DiscoChat is a Discord bot that integrates OpenAI's API and a vector database (ChromaDB) for context-aware AI conversations.

  • Chroma allows for filtering over metadata. If you assign metadata that defines the privilege level required to access the data, or some other method of segmenting, you can then use a where condition within the query to retrieve documents that pertain to the filter. I use this to retrieve semantically similar messages from a given discord channel and pipe them into context. https://github.com/tomthefreakmusic/Discochat

  • InfluxDB

    Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

    InfluxDB logo
NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a more popular project.

Suggest a related project

Related posts

  • It seems like almost everyone here is working on a SaaS for other SaaS bootstrappers —- is anyone building a product for a vertical outside of email/marketing/forms/dev tools/productivity?

    1 project | /r/SaaS | 6 Jun 2023
  • Anyone using AI for enterprise content management?

    1 project | /r/managers | 31 May 2023
  • Does anyone have ideas on how to reach out to other startups to pitch our startup program?

    1 project | /r/startups | 19 Apr 2023
  • Show HN: Build your perfect document management system using Open Core software

    1 project | news.ycombinator.com | 19 Apr 2023
  • Email filing & automation methods & systems

    1 project | /r/paralegal | 12 Apr 2023