PyFastPFor vs interpolative_coding

Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

www.influxdata.com

featured

SaaSHub - Software Alternatives and Reviews

SaaSHub helps you find the best software and product alternatives

www.saashub.com

featured

PyFastPFor		interpolative_coding
	Project
2	Mentions	1
56	Stars	27
-	Growth	-
4.6	Activity	0.0
7 months ago	Latest Commit	over 1 year ago
C++	Language	C++
Apache License 2.0	License	MIT License

The number of mentions indicates the total number of mentions that we've tracked plus the number of user suggested alternatives.
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.

PyFastPFor

Posts with mentions or reviews of PyFastPFor. We have used some of these posts to build our list of alternatives and similar projects. The last one was on 2022-05-14.

Time-Series Compression Algorithms
7 projects | news.ycombinator.com | 14 May 2022

One notable omission from this piece is a technique to compress integer time series with both positive and negative values.
If you naively apply bit-packing using the Simple8b algorithm, you'll find that negative integers are not compressed. This is due to how signed integers are represented in modern computers: negative integers will have their most significant bit set [1].
Zigzag encoding is a neat transform that circumvents this issue. It works by mapping signed integers to unsigned integers so that numbers with a small absolute value can be encoded using a small number of bits. Put another way, it encodes negative numbers using the least significant bit for sign. [2]
If you're looking for a quick way to experiment with various time series compression algorithm I highly recommend Daniel Lemire's FastPFor repository [3] (as linked in the article). I've used the Python bindings [4] to quickly evaluate various compression algorithms with great success.
Finally I'd like to humbly mention my own tiny contribution [5], an adaptation of Lemire's C++ Simple8b implementation (including basic methods for delta & zigzag encoding/decoding).
I used C++ templates to make the encoding and decoding routines generic over integer bit-width, which expands support up to 64 bit integers, and offers efficient usage with smaller integers (eg 16 bit). I made a couple other minor tweaks including support for arrays up to 2^64 in length, and tweaking the API/method signatures so they can be used in a more functional style. This implementation is slightly simpler to invoke via FFI, and I intend to add examples showing how to compile for usage via JS (WebAssembly), Python, and C#. I threw my code up quickly in order to share with you all, hopefully someone finds it useful. I intend to expand on usage examples/test cases/etc, and am looking forward to any comments or contributions.
[1] https://en.wikipedia.org/wiki/Signed_number_representation
[2] https://en.wikipedia.org/wiki/Variable-length_quantity#Zigza...
[3] https://github.com/lemire/FastPFor
[4] https://github.com/searchivarius/PyFastPFor
[5] https://github.com/naturalplasmoid/simple8b-timeseries-compr...
The big-load anti-pattern
2 projects | news.ycombinator.com | 21 Aug 2021

interpolative_coding

Posts with mentions or reviews of interpolative_coding. We have used some of these posts to build our list of alternatives and similar projects. The last one was on 2022-05-14.

Time-Series Compression Algorithms
7 projects | news.ycombinator.com | 14 May 2022

I didn't see binary interpolative coding (BIC) referenced. It is one of my favorites introduced to me by the book "Managing Gigabytes" by Moffett and Bell [1]. It has great compression ratio for sequences and is commonly used in inverted indexes.
There is neat implementation [2] and technical paper [3] by Giulio Ermanno Pibiri, which I just found today by looking for it.
[1] https://people.eng.unimelb.edu.au/ammoffat/mg/
[2] https://github.com/jermp/interpolative_coding
[3] http://pages.di.unipi.it/pibiri/papers/BIC.pdf

What are some alternatives?

When comparing PyFastPFor and interpolative_coding you can also consider the following projects:

apultra - Free open-source compressor for apLib with 5-7% better ratios

simple8b-timeseries-compression

simple8b-timeseries-compr

lib7zip - c++ library wrapper of 7zip

FastPFor - The FastPFOR C++ library: Fast integer compression

banyan

PyFastPFor vs apultra interpolative_coding vs simple8b-timeseries-compression PyFastPFor vs simple8b-timeseries-compression interpolative_coding vs simple8b-timeseries-compr PyFastPFor vs lib7zip interpolative_coding vs FastPFor PyFastPFor vs simple8b-timeseries-compr PyFastPFor vs banyan

Compare PyFastPFor vs interpolative_coding and see what are their differences.

PyFastPFor

interpolative_coding

PyFastPFor

interpolative_coding

What are some alternatives?