smaz
FiniteStateEntropy
Our great sponsors
smaz | FiniteStateEntropy | |
---|---|---|
3 | 4 | |
1,131 | 1,263 | |
- | - | |
0.0 | 0.0 | |
over 4 years ago | over 1 year ago | |
C | C | |
BSD 3-clause "New" or "Revised" License | BSD 2-clause "Simplified" License |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
smaz
-
Advanced MessagePack capabilities
Choose the data compression algorithm based on the specifics of your data. For example, if you are working with lots of short strings, take a look at [*SMAZ](https://github.com/antirez/smaz).*
-
Improving short string compression.
Take a look at this. Idea behind it seems nice, but it's fixed dictionary ("codebook") was clearly made for English language, and the algorithm itself is really simple. How can we impove on this? Dynamic dictionary won't do, since you have to store it somewhere, nullifying benefits of using such algorithm. Beyond that I have no idea.
-
C Deep
smaz - Efficient string compression library. BSD-3-Clause
FiniteStateEntropy
-
Intel QuickAssist Technology Zstandard Plugin for Zstandard
It's obsolete. It's limited to 32KB LZ window with huffman coding. Zstd can use a much larger window (8MB recommended) and a much better entropy coder: https://github.com/Cyan4973/FiniteStateEntropy
-
Worries about tANS?
tANS block based : FSE
-
Silly Lossy Text Compression Idea
Sounds similar to: https://github.com/Cyan4973/FiniteStateEntropy
https://arxiv.org/abs/1311.2540
> The modern data compression is mainly based on two approaches to entropy coding: Huffman (HC) and arithmetic/range coding (AC). The former is much faster, but approximates probabilities with powers of 2, usually leading to relatively low compression rates. The latter uses nearly exact probabilities - easily approaching theoretical compression rate limit (Shannon entropy), but at cost of much larger computational cost.
-
C Deep
FiniteStateEntropy - Two highly efficient compression codecs optimized for modern CPUs. BSD-2-Clause
What are some alternatives?
LZMAT - git mirror of LZMAT (http://www.matcode.com/lzmat.htm)
Snappy - A fast compressor/decompressor
zstd - Zstandard - Fast real-time compression algorithm
doboz
zlib-ng - zlib replacement with optimizations for "next generation" systems.
brotli - Brotli compression format
ZLib - A massively spiffy yet delicately unobtrusive compression library.
LZFSE - LZFSE compression library and command line tool
LZHAM - Lossless data compression codec with LZMA-like ratios but 1.5x-8x faster decompression speed, C/C++
LZMA - (Unofficial) Git mirror of LZMA SDK releases