chroma vs pgvector

chroma

the AI-native open-source embedding database (by chroma-core)

Source Code

trychroma.com

Suggest alternative

Edit details

pgvector

Open-source vector similarity search for Postgres (by pgvector)

nearest-neighbor-search approximate-nearest-neighbor-search

Source Code

Suggest alternative

Edit details

InfluxDB - Power Real-Time Data Analytics at Scale

Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

www.influxdata.com

featured

SaaSHub - Software Alternatives and Reviews

SaaSHub helps you find the best software and product alternatives

www.saashub.com

featured

chroma		pgvector
	Project
32	Mentions	78
12,324	Stars	9,349
5.5%	Growth	7.0%
9.8	Activity	9.9
6 days ago	Latest Commit	3 days ago
Python	Language	C
Apache License 2.0	License	GNU General Public License v3.0 or later

The number of mentions indicates the total number of mentions that we've tracked plus the number of user suggested alternatives.
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.

chroma

Posts with mentions or reviews of chroma. We have used some of these posts to build our list of alternatives and similar projects. The last one was on 2024-04-23.

Let’s build AI-tools with the help of AI and Typescript!
5 projects | dev.to | 23 Apr 2024

Package installer for Python (pip), we use this for installing the Python-based packages, such as Jupyter Lab, and we're going to use this for installing other Python-based tools like the Chroma DB vector database
Mixtral 8x22B
4 projects | news.ycombinator.com | 17 Apr 2024

Optional: You can use SillyTavern[1] for a more "rich" chat experience
The above lets me chat, at least superficially, with my friend. It's nice for simple interactions and banter; I've found it to be a positive and reflective experience.
0. https://www.trychroma.com/
7 Vector Databases Every Developer Should Know!
4 projects | dev.to | 8 Feb 2024

Chroma DB is a newer entrant in the vector database arena, designed specifically for handling high-dimensional color vectors. It's particularly useful for applications in digital media, e-commerce, and content discovery, where color similarity plays a crucial role in search and recommendation algorithms.
AI Grant Traction in OSS Startups
5 projects | dev.to | 1 Feb 2024

View on GitHub
Qdrant, the Vector Search Database, raised $28M in a Series A round
8 projects | news.ycombinator.com | 23 Jan 2024
Vector Databases: A Technical Primer [pdf]
7 projects | news.ycombinator.com | 12 Jan 2024

For Python I believe Chroma [1] can be used embedded.
For Go I recently started building chromem-go, inspired by the Chroma interface: https://github.com/philippgille/chromem-go
It's neither advanced nor for scale yet, but the RAG demo works.
[1] https://github.com/chroma-core/chroma
Chroma – the open-source embedding database
1 project | news.ycombinator.com | 11 Jan 2024
Show HN: Embeddings Solution for Personal Journal
2 projects | news.ycombinator.com | 1 Nov 2023

The formatting is a bit off.
The web app is here: https://jumblejournal.org
The DB used is here: https://www.trychroma.com/
SQLite vs. Chroma: A Comparative Analysis for Managing Vector Embeddings
2 projects | dev.to | 7 Oct 2023

Whether you’re navigating through well-known options like SQLite, enriched with the sqlite-vss extension, or exploring other avenues like Chroma, an open-source vector database, selecting the right tool is paramount. This article compares these two choices, guiding you through the pros and cons of each, helping you choose the right tool for storing and querying vector embeddings for your project.
How to use Chroma to store and query vector embeddings
3 projects | dev.to | 2 Oct 2023

Create a new project directory for our example project. Next, we need to clone the Chroma repository to get started. At the root of your project directory let's clone Chroma into it:

pgvector

Posts with mentions or reviews of pgvector. We have used some of these posts to build our list of alternatives and similar projects. The last one was on 2024-04-25.

Integrate txtai with Postgres
2 projects | dev.to | 25 Apr 2024

# Install Postgres and pgvector !apt-get update && apt install postgresql postgresql-server-dev-14 !git clone --branch v0.6.2 https://github.com/pgvector/pgvector.git !cd pgvector && make && make install # Start database !service postgresql start !sudo -u postgres psql -U postgres -c "ALTER USER postgres PASSWORD 'pass';"
Vector Database solutions on AWS
1 project | dev.to | 28 Mar 2024

When talking about Vector Databases, in the market we can find the specialized ones and multi-model, most of the major database providers like Oracle, PostgreSQL or MongoDB, for mention some of them, have integrated a specific solution to retrieve vector data.
Using pgvector To Locate Similarities In Enterprise Data
2 projects | dev.to | 21 Mar 2024

For this example, I wanted to focus on how pgvector – an open-source vector similarity search for Postgres – can be used to identify data similarities that exist in enterprise data.
pgvector vs. pgvecto.rs in 2024: A Comprehensive Comparison for Vector Search in PostgreSQL
1 project | dev.to | 19 Mar 2024

pgvector supports dense vector search well, but it does not have plan to support sparse vector.
Pg_vectorize: The simplest way to do vector search and RAG on Postgres
6 projects | news.ycombinator.com | 6 Mar 2024

There's an issue in the pgvector repo about someone having several ~10-20million row tables and getting acceptable performance with the right hardware and some performance tuning: https://github.com/pgvector/pgvector/issues/455
I'm in the early stages of evaluating pgvector myself. but having used pinecone I currently am liking pgvector better because of it being open source. The indexing algorithm is clear, one can understand and modify the parameters. Furthermore the database is postgresql, not a proprietary document store. When the other data in the problem is stored relationally, it is very convenient to have the vectors stored like this as well. And postgresql has good observability and metrics. I think when it comes to flexibility for specialized applications, pgvector seems like the clear winner. But I can definitely see pinecone's appeal if vector search is not a core component of the problem/business, as it is very easy to use and scales very easily
FLaNK 04 March 2024
26 projects | dev.to | 4 Mar 2024
Vector Database and Spring IA
2 projects | dev.to | 11 Feb 2024

The Spring AI project aims to streamline the development of applications that incorporate artificial intelligence functionality without unnecessary complexity. On this example we use features like: Embedding, Prompts, ETL and save all embedding on PGvector(Postgres Vector database)
Use pgvector for searching images on Azure Cosmos DB for PostgreSQL
2 projects | dev.to | 7 Feb 2024

Official GitHub repository of the pgvector extension
pgvector 0.6.0: 30x faster with parallel index builds
1 project | dev.to | 31 Jan 2024

pgvector 0.6.0 was just released and will be available on Supabase projects soon. Again, a special shout out to Andrew Kane and everyone else who worked on parallel index builds.
Store embeddings in Azure Cosmos DB for PostgreSQL with pgvector
2 projects | dev.to | 29 Jan 2024

The pgvector extension adds vector similarity search capabilities to your PostgreSQL database. To use the extension, you have to first create it in your database. You can install the extension, by connecting to your database and running the CREATE EXTENSION command from the psql command prompt:

What are some alternatives?

When comparing chroma and pgvector you can also consider the following projects:

SillyTavern - LLM Frontend for Power Users.

Milvus - A cloud-native vector database, storage for next generation AI applications

faiss - A library for efficient similarity search and clustering of dense vectors.

golang-ical - A ICS / ICal parser and serialiser for Golang.

Weaviate - Weaviate is an open-source vector database that stores both objects and vectors, allowing for the combination of vector search with structured filtering with the fault tolerance and scalability of a cloud-native database.

AutoGPT - AutoGPT is the vision of accessible AI for everyone, to use and to build on. Our mission is to provide the tools, so that you can focus on what matters.

Elasticsearch - Free and Open, Distributed, RESTful Search Engine

qdrant - Qdrant - High-performance, massive-scale Vector Database for the next generation of AI. Also available in the cloud https://cloud.qdrant.io/

SillyTavern - LLM Frontend for Power Users. [Moved to: https://github.com/SillyTavern/SillyTavern]

ann-benchmarks - Benchmarks of approximate nearest neighbor libraries in Python