Vector Databases for Data-Centric AI (Part 2)

This page summarizes the projects mentioned and recommended in the original post on dev.to

InfluxDB - Power Real-Time Data Analytics at Scale
Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
www.influxdata.com
featured
SaaSHub - Software Alternatives and Reviews
SaaSHub helps you find the best software and product alternatives
www.saashub.com
featured
  • QDrant-NLP

    QDrant-NLP

  • Just clone the repo QDrant-NLP and run: docker-compose up I would like to increase the number of datasets this can be tried on, either with GPU backed lambda functions or by saving many example datasets to S3. So far I've only made a 6K subset of ag_news available. ag_news · Datasets at Hugging Face This is the code snippet used to generate the embeddings via hugging-face:

  • refinery

    The data scientist's open-source choice to scale, assess and maintain natural language data. Treat training data like a software artifact.

  • Shout out to both Kern.AI (an excellent open-source NLP labelling tool) https://github.com/code-kern-ai/refinery and Voxel51 (an excellent open-source Computer Vision analysis tool) https://github.com/voxel51/fiftyone for being early adopters of the technology in their platforms, but I don't believe either have yet made use of all of the value it can provide.

  • InfluxDB

    Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

    InfluxDB logo
  • fiftyone

    The open-source tool for building high-quality datasets and computer vision models

  • Shout out to both Kern.AI (an excellent open-source NLP labelling tool) https://github.com/code-kern-ai/refinery and Voxel51 (an excellent open-source Computer Vision analysis tool) https://github.com/voxel51/fiftyone for being early adopters of the technology in their platforms, but I don't believe either have yet made use of all of the value it can provide.

NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a more popular project.

Suggest a related project

Related posts

  • My Favorite DevTools to Build AI/ML Applications!

    9 projects | dev.to | 23 Apr 2024
  • Creating a Sales Analysis Application with Streamlit: A Practical Approach to Business Intelligence

    1 project | dev.to | 19 Apr 2024
  • 🦙 Llama-2-GGML-CSV-Chatbot 🤖

    3 projects | dev.to | 10 Apr 2024
  • Show HN: Buefy Web Components for Streamlit

    2 projects | news.ycombinator.com | 4 Mar 2024
  • Simplify Web App Development: Code Lite, Create Big!

    1 project | dev.to | 26 Feb 2024