simple8b-timeseries-compression VS corpuscompression

Compare simple8b-timeseries-compression vs corpuscompression and see what are their differences.

corpuscompression

Achieve better compression for small objects with a predefined corpus (by spullara)
InfluxDB - Power Real-Time Data Analytics at Scale
Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
www.influxdata.com
featured
SaaSHub - Software Alternatives and Reviews
SaaSHub helps you find the best software and product alternatives
www.saashub.com
featured
simple8b-timeseries-compression corpuscompression
1 2
7 7
- -
10.0 0.0
about 2 years ago over 3 years ago
C++ Java
Apache License 2.0 -
The number of mentions indicates the total number of mentions that we've tracked plus the number of user suggested alternatives.
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.

simple8b-timeseries-compression

Posts with mentions or reviews of simple8b-timeseries-compression. We have used some of these posts to build our list of alternatives and similar projects. The last one was on 2022-05-14.
  • Time-Series Compression Algorithms
    7 projects | news.ycombinator.com | 14 May 2022
    One notable omission from this piece is a technique to compress integer time series with both positive and negative values.

    If you naively apply bit-packing using the Simple8b algorithm, you'll find that negative integers are not compressed. This is due to how signed integers are represented in modern computers: negative integers will have their most significant bit set [1].

    Zigzag encoding is a neat transform that circumvents this issue. It works by mapping signed integers to unsigned integers so that numbers with a small absolute value can be encoded using a small number of bits. Put another way, it encodes negative numbers using the least significant bit for sign. [2]

    If you're looking for a quick way to experiment with various time series compression algorithm I highly recommend Daniel Lemire's FastPFor repository [3] (as linked in the article). I've used the Python bindings [4] to quickly evaluate various compression algorithms with great success.

    Finally I'd like to humbly mention my own tiny contribution [5], an adaptation of Lemire's C++ Simple8b implementation (including basic methods for delta & zigzag encoding/decoding).

    I used C++ templates to make the encoding and decoding routines generic over integer bit-width, which expands support up to 64 bit integers, and offers efficient usage with smaller integers (eg 16 bit). I made a couple other minor tweaks including support for arrays up to 2^64 in length, and tweaking the API/method signatures so they can be used in a more functional style. This implementation is slightly simpler to invoke via FFI, and I intend to add examples showing how to compile for usage via JS (WebAssembly), Python, and C#. I threw my code up quickly in order to share with you all, hopefully someone finds it useful. I intend to expand on usage examples/test cases/etc, and am looking forward to any comments or contributions.

    [1] https://en.wikipedia.org/wiki/Signed_number_representation

    [2] https://en.wikipedia.org/wiki/Variable-length_quantity#Zigza...

    [3] https://github.com/lemire/FastPFor

    [4] https://github.com/searchivarius/PyFastPFor

    [5] https://github.com/naturalplasmoid/simple8b-timeseries-compr...

corpuscompression

Posts with mentions or reviews of corpuscompression. We have used some of these posts to build our list of alternatives and similar projects. The last one was on 2022-05-14.
  • Time-Series Compression Algorithms
    7 projects | news.ycombinator.com | 14 May 2022
    I've always liked this kind of thing. I've also done some experiments with automatically memorizing better starting dictionaries for a corpus eras where the data is small:

    https://github.com/spullara/corpuscompression

    Another interesting case is using a compressed stream to indicate anomalies in the data when the compression ratio spikes down like in log analysis.

  • Text Classification by Data Compression
    2 projects | news.ycombinator.com | 8 Jun 2021
    Was going to come here to say that. Played around with this a bit for compressing small fields using a learned dictionary:

    https://github.com/spullara/corpuscompression

What are some alternatives?

When comparing simple8b-timeseries-compression and corpuscompression you can also consider the following projects:

PyFastPFor - Python bindings for the fast integer compression library FastPFor.

simple8b-timeseries-compr

interpolative_coding - A flexible and efficient C++ implementation of the Binary Interpolative Coding algorithm.

FastPFor - The FastPFOR C++ library: Fast integer compression

banyan