postgres-elasticsearch-fdw
bayard
Our great sponsors
postgres-elasticsearch-fdw | bayard | |
---|---|---|
3 | 4 | |
106 | 1,839 | |
- | 0.1% | |
4.2 | 0.0 | |
29 days ago | about 1 year ago | |
Python | Rust | |
MIT License | MIT License |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
postgres-elasticsearch-fdw
- Full-text search engine with PostgreSQL (part 2): Postgres vs. Elasticsearch
-
Postgres Full Text Search vs. the Rest
My experience with Postgres FTS (did a comparison with Elastic a couple years back), is that filtering works fine and is speedy enough, but ranking crumbles when the resulting set is large.
If you have a large-ish data set with lots of similar data (4M addresses and location names was the test case), Postgres FTS just doesn't perform.
There is no index that helps scoring results. You would have to install an extension like RUM index (https://github.com/postgrespro/rum) to improve this, which may or may not be an option (often not if you use managed databases).
If you want a best of both worlds, one could investigate this extensions (again, often not an option for managed databases): https://github.com/matthewfranglen/postgres-elasticsearch-fd...
Either way, writing something that indexes your postgres database into elastic/opensearch is a one time investment that usually pays off in the long run.
-
Lesser Known PostgreSQL Features
I used a foreign data wrapper to query elasticsearch indexes from within postgres.[0]
It pushed alot of complexity down away from higher-level app developers not familiar with ES patterns.
[0]: https://github.com/matthewfranglen/postgres-elasticsearch-fd...
bayard
-
An alternative to Elasticsearch that runs on a few MBs of RAM
Somewhat related, this guy: https://github.com/mosuka/ seems to be very passionate about search service.
He built two distributed search services:
- https://github.com/mosuka/phalanx, written in Go.
- https://github.com/mosuka/bayard, written in Rust.
-
Postgres Full Text Search vs. the Rest
I wish we had an extension like ZomboDB but using a lighter search engine like https://github.com/quickwit-oss/quickwit, https://github.com/toshi-search/Toshi and https://github.com/mosuka/bayard
Here I'm listing engines based on https://github.com/quickwit-oss/tantivy - tantivy is comparable to Lucene in its scope - but I'm sure there are other engines that could tackle ElasticSearch.
Another thing that could happen is maybe directly embed tantivy in Postgres using an extension, perhaps this could be an option too.
-
Meilisearch, the Rust search engine, just raised $5M
So there's more than one? The one I knew was https://github.com/quickwit-oss/tantivy and https://github.com/quickwit-oss/quickwit on top of it (there's a couple of other search engines built on top of tantivy, like https://github.com/bayard-search/bayard)
What are some alternatives?
rum - RUM access method - inverted index with additional information in posting lists
quickwit - Cloud-native search engine for observability. An open-source alternative to Datadog, Elasticsearch, Loki, and Tempo.
tbls - tbls is a CI-Friendly tool for document a database, written in Go.
tantivy - Tantivy is a full-text search engine library inspired by Apache Lucene and written in Rust
pg-ulid - ULID Functions for PostgreSQL
sonic - 🦔 Fast, lightweight & schema-less search backend. An alternative to Elasticsearch that runs on a few MBs of RAM.
pgvector - Open-source vector similarity search for Postgres
zincsearch - ZincSearch . A lightweight alternative to elasticsearch that requires minimal resources, written in Go.
postgres-elasticsearch-fd
quickwit - Quickwit is a fast and cost-efficient distributed search engine for large-scale, immutable data. [Moved to: https://github.com/quickwit-oss/quickwit]
js-id - ID generation for JavaScript & TypeScript Applications
lyra - 🌌 Fast, in-memory, typo-tolerant, full-text search engine written in TypeScript. [Moved to: https://github.com/LyraSearch/lyra]