postgres-word2vec
utils to use word embedding models like word2vec vectors in a PostgreSQL database (by guenthermi)
TurboPFor
Fastest Integer Compression (by powturbo)
postgres-word2vec | TurboPFor | |
---|---|---|
2 | 8 | |
140 | 745 | |
- | - | |
2.6 | 8.5 | |
over 2 years ago | 2 months ago | |
C | C | |
MIT License | GNU General Public License v3.0 only |
The number of mentions indicates the total number of mentions that we've tracked plus the number of user suggested alternatives.
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
postgres-word2vec
Posts with mentions or reviews of postgres-word2vec.
We have used some of these posts to build our list of alternatives
and similar projects. The last one was on 2022-09-27.
-
Storing word / document vectors in RDBMS
I've recently stumbled upon smaller projects, like FREDDY (https://github.com/guenthermi/postgres-word2vec), a Postgres extension that looks interesting. The ability to write ad-hoc similarity queries in SQL seems like it might be valuable in some circumstances. I'm not sure about performance or storage efficacy.
-
Build a fuzzy search with PostgreSQL
The syntactic similarity is not enough for you? Word2vec could be an option to compare the semantic similarity of words. Luckily there already is a postgres-word2vec extension for this.
TurboPFor
Posts with mentions or reviews of TurboPFor.
We have used some of these posts to build our list of alternatives
and similar projects. The last one was on 2022-02-22.
- Show HN: Time Series Benchmark TurboPFor,TurboFloat,TurboFloat LzX,TurboGorilla
-
Show HN: HN:The Gorilla in the Room:Exploring RedisTimeSeries Optimizations
[4] https://github.com/powturbo/TurboPFor-Integer-Compression/is...
-
Float Compression 9: Lzsse and Lizard
The bytedelta described in the blog is suboptimal and does might not work with other datasets.
Download icapp from https://github.com/powturbo/TurboPFor-Integer-Compression/re... and make your own tests with your data.
[1] https://github.com/powturbo/TurboPFor-Integer-Compression
- Show HN: 1D/2D/3D Lossless/Lossy Floating Point Compression with TurboPFor
-
How do Games manage NPC schedules?
I use a fake database paired with compressed bits for flags and integer compression for various other traits. They follow a navigation guide similar to wind for foliage.
-
Quantile Compression: 35% higher compression ratio for numeric sequences than any other compressor
It could be nice to see a comparison against https://github.com/powturbo/TurboPFor-Integer-Compression !
-
q_compress 0.7: still has 35% higher compression ratio than .zstd.parquet for numerical sequences, now with delta encoding and 2x faster than before
I'm the author of TurboPFor-Integer-Compression. Q_compress is a very interresting project, unfortunatelly it's difficult to compare it to other algorithms. There is not binary or test data files (with q_compress results) available for a simple benchmark. Speed comparison would also be helpfull.
-
C Deep
TurboPFor - Fastest integer compression. GPL-2.0-or-later
What are some alternatives?
When comparing postgres-word2vec and TurboPFor you can also consider the following projects:
TimescaleDB - An open-source time-series SQL database optimized for fast ingest and complex queries. Packaged as a PostgreSQL extension.
x3-rust - X3 Lossless Audio Compression for Rust