interpolative_coding
banyan
interpolative_coding | banyan | |
---|---|---|
1 | 1 | |
27 | 36 | |
- | - | |
0.0 | 0.0 | |
over 1 year ago | about 1 month ago | |
C++ | Rust | |
MIT License | Apache License 2.0 |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
interpolative_coding
-
Time-Series Compression Algorithms
I didn't see binary interpolative coding (BIC) referenced. It is one of my favorites introduced to me by the book "Managing Gigabytes" by Moffett and Bell [1]. It has great compression ratio for sequences and is commonly used in inverted indexes.
There is neat implementation [2] and technical paper [3] by Giulio Ermanno Pibiri, which I just found today by looking for it.
[1] https://people.eng.unimelb.edu.au/ammoffat/mg/
[2] https://github.com/jermp/interpolative_coding
[3] http://pages.di.unipi.it/pibiri/papers/BIC.pdf
banyan
-
Time-Series Compression Algorithms
I have found that a very good approach is to apply some very simple transformations such as delta encoding of timestamps, and then letting a good standard compression algorithm such as zstd or deflate take care of the rest.
Delta encoding of timestamps helps a lot though, because it makes the redundancy more visible to a general purpose compression algorithm.
I used this for the telemetry storage of the Columbus module of the international space station, back in ~2010, and then a few times since.
http://blog.klaehn.org/2018/06/10/efficient-telemetry-storag...
https://github.com/Actyx/banyan
What are some alternatives?
simple8b-timeseries-compression
simple8b-timeseries-compr
PyFastPFor - Python bindings for the fast integer compression library FastPFor.
FastPFor - The FastPFOR C++ library: Fast integer compression