Top 23 C Compression Projects

zstd

109 22,581 9.6 C

Zstandard - Fast real-time compression algorithm

Project mention: Rethinking string encoding: a 37.5% space efficient encoding than UTF-8 in Fury | news.ycombinator.com | 2024-05-07

> In such cases, the serialized binary are mostly in 200~1000 bytes. Not big enough for zstd to work
You're not referring to the same dictionary that I am. Look at --train in [1].
If you have a training corpus of representative data, you can generate a dictionary that you preshare on both sides which will perform much better for very small binaries (including 200-1k bytes).
If you want maximum flexibility (i.e. you don't know the universe of representative messages ahead of time or you want maximum compression performance), you can gather this corpus transparently as messages are generated & then generate a dictionary & attach it as sideband metadata to a message. You'll probably need to defer the decoding if it references a dictionary not yet received (i.e. send delivers messages out-of-order from generation). There are other techniques you can apply, but the general rule is that your custom encoding scheme is unlikely to outperform zstd + a representative training corpus. If it does, you'd need to actually show this rather than try to argue from first principles.
[1] https://github.com/facebook/zstd/blob/dev/programs/zstd.1.md

LZ4

21 9,312 9.5 C

Extremely Fast Compression algorithm
InfluxDB

www.influxdata.com featured

Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
ZLib

49 5,346 8.8 C

A massively spiffy yet delicately unobtrusive compression library.

Project mention: Zlib 1.3.1 Out | news.ycombinator.com | 2024-01-22

cute_headers

5 4,129 6.0 C

Collection of cross-platform one-file C/C++ libraries with no dependencies, primarily used for games
opus

26 2,142 9.5 C

Modern audio compression for the internet.

Project mention: TSAC: Low Bitrate Audio Compression | news.ycombinator.com | 2024-04-08

Opus doesn't support 44.1 kHz because compatibility and effort/benefit ratio:
https://github.com/xiph/opus/issues/43
The browser audio limitation is presumably a workaround to some bug or performance limitation that was relevant at some point in history (the site was created in 2014).

LZFSE

1 1,759 0.0 C

LZFSE compression library and command line tool
cstore_fdw

6 1,738 2.6 C

Columnar storage extension for Postgres built as a foreign data wrapper. Check out https://github.com/citusdata/citus for a modernized columnar storage implementation built as a table access method.

Project mention: Moving a Billion Postgres Rows on a $100 Budget | news.ycombinator.com | 2024-02-21

Columnar store PostgreSQL extension exists, here are two but I think I’m missing at least another one:
https://github.com/citusdata/cstore_fdw
https://github.com/hydradatabase/hydra
You can also connect other stores using the foreign data wrappers, like parquet files stored on an object store, duckdb, clickhouse… though the joins aren’t optimised as PostgreSQL would do full scan on the external table when joining.

SaaSHub

www.saashub.com featured

SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives
zlib-ng

13 1,453 9.3 C

zlib replacement with optimizations for "next generation" systems.

Project mention: Show HN: Pzip- blazing fast concurrent zip archiver and extractor | news.ycombinator.com | 2023-09-24

Please note that allowing for 2% bigger resulting file could mean huge speedup in these circumstances even with the same compression routines, seeing these benchmarks of zlib and zlib-ng for different compression levels:
https://github.com/zlib-ng/zlib-ng/discussions/871
IMO the fair comparison of the real speed improvement brought by a new program is only between the almost identical resulting compressed sizes.

zip

2 1,333 7.1 C

A portable, simple zip library written in C (by kuba--)
FiniteStateEntropy

4 1,263 0.0 C

New generation entropy codecs : Finite State Entropy and Huff0

Project mention: Intel QuickAssist Technology Zstandard Plugin for Zstandard | news.ycombinator.com | 2023-08-16

It's obsolete. It's limited to 32KB LZ window with huffman coding. Zstd can use a much larger window (8MB recommended) and a much better entropy coder: https://github.com/Cyan4973/FiniteStateEntropy

Minizip-ng

0 1,173 7.9 C

Fork of the popular zip manipulation library found in the zlib distribution.
smaz

3 1,131 0.0 C

Small strings compression library
astc-encoder

1 997 7.9 C

The Arm ASTC Encoder, a compressor for the Adaptive Scalable Texture Compression data format.
c-blosc

1 963 5.1 C

A blocking, shuffling and loss-less compression library that can be faster than `memcpy()`.
lzbench

9 848 1.4 C

lzbench is an in-memory benchmark of open-source LZ77/LZSS/LZMA compressors

Project mention: Chrome Feature: ZSTD Content-Encoding | news.ycombinator.com | 2024-04-01

For a benchmark on a standard set: https://github.com/inikep/lzbench/blob/master/lzbench18_sort...

TurboPFor

8 746 8.5 C

Fastest Integer Compression

Project mention: Show HN: Time Series Benchmark TurboPFor,TurboFloat,TurboFloat LzX,TurboGorilla | news.ycombinator.com | 2023-06-25

p7zip

13 744 0.0 C

A new p7zip fork with additional codecs and improvements (forked from https://sourceforge.net/projects/sevenzip/ AND https://sourceforge.net/projects/p7zip/).
squashfs-tools

2 719 9.3 C

tools to create and extract Squashfs filesystems
bzip3

12 650 6.0 C

A better and stronger spiritual successor to BZip2.

Project mention: Bzip2 Format Specification [pdf] | news.ycombinator.com | 2024-04-10

Kamila Szewczyk is working on a bzip3 to improve the state-of-the-art in the domain of compressors based on Burrows-Wheeler:
https://github.com/kspalaiologos/bzip3
I’m keeping fingers crossed for the project. Especially given that the author is 19 and her best work is yet to come.

lizard

4 639 0.0 C

Lizard (formerly LZ5) is an efficient compressor with very fast decompression. It achieves compression ratio that is comparable to zip/zlib and zstd/brotli (at low and medium compression levels) at decompression speed of 1000 MB/s and faster. (by inikep)
zson

1 527 3.0 C

ZSON is a PostgreSQL extension for transparent JSONB compression
simdcomp

1 476 3.5 C

A simple C library for compressing lists of integers using binary packing
gozstd

1 419 0.0 C

go wrapper for zstd

Project mention: Chrome Feature: ZSTD Content-Encoding | news.ycombinator.com | 2024-04-01

SaaSHub

www.saashub.com featured

SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives

NOTE: The open source projects on this list are ordered by number of github stars. The number of mentions indicates repo mentiontions in the last 12 Months or since we started tracking (Dec 2020).

C Compression related posts

LuaRT 1.8.0 – open-source Windows programming framework for Lua

2 projects | news.ycombinator.com | 22 May 2024
Ask HN: Why are people so mean in the open source community? (about xz again)

1 project | news.ycombinator.com | 18 May 2024
VDO: Userspace tools for pools of deduplicated and compressed block storage

1 project | news.ycombinator.com | 14 May 2024
Rethinking string encoding: a 37.5% space efficient encoding than UTF-8 in Fury

2 projects | news.ycombinator.com | 7 May 2024
Drink Me: (Ab)Using a LLM to Compress Text

2 projects | news.ycombinator.com | 4 May 2024
FC8 – Faster 68K Decompression (2016)

3 projects | news.ycombinator.com | 29 Apr 2024
SQLite VFS for ZSTD seekable format

2 projects | news.ycombinator.com | 26 Apr 2024
A note from our sponsor - InfluxDB
www.influxdata.com | 1 Jun 2024

Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality. Learn more →

Index

What are some of the best open-source Compression projects in C? This list will help you:

	Project	Stars
1	zstd	22,581
2	LZ4	9,312
3	ZLib	5,346
4	cute_headers	4,129
5	opus	2,142
6	LZFSE	1,759
7	cstore_fdw	1,738
8	zlib-ng	1,453
9	zip	1,333
10	FiniteStateEntropy	1,263
11	Minizip-ng	1,173
12	smaz	1,131
13	astc-encoder	997
14	c-blosc	963
15	lzbench	848
16	TurboPFor	746
17	p7zip	744
18	squashfs-tools	719
19	bzip3	650
20	lizard	639
21	zson	527
22	simdcomp	476
23	gozstd	419

C Compression

Top 23 C Compression Projects

C Compression related posts

LuaRT 1.8.0 – open-source Windows programming framework for Lua

Ask HN: Why are people so mean in the open source community? (about xz again)

VDO: Userspace tools for pools of deduplicated and compressed block storage

Rethinking string encoding: a 37.5% space efficient encoding than UTF-8 in Fury

Drink Me: (Ab)Using a LLM to Compress Text

FC8 – Faster 68K Decompression (2016)

SQLite VFS for ZSTD seekable format

Index