Top 23 Python Compression Projects

DeepSpeed

51 32,550 9.8 Python

DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

Project mention: Can we discuss MLOps, Deployment, Optimizations, and Speed? | /r/LocalLLaMA | 2023-12-06

DeepSpeed can handle parallelism concerns, and even offload data/model to RAM, or even NVMe (!?) . I'm surprised I don't see this project used more.

PaddleNLP

2 11,423 9.8 Python

👑 Easy-to-use and powerful NLP and LLM library with 🤗 Awesome model zoo, supporting wide-range of NLP tasks from research to industrial applications, including 🗂Text Classification, 🔍 Neural Search, ❓ Question Answering, ℹ️ Information Extraction, 📄 Document Intelligence, 💌 Sentiment Analysis etc.
WorkOS

workos.com sponsored

The modern identity platform for B2B SaaS. The APIs are flexible and easy-to-use, supporting authentication, user identity, and complex enterprise features like SSO and SCIM provisioning.
BorgBackup

333 10,526 9.4 Python

Deduplicating archiver with compression and authenticated encryption.

Project mention: Ask HN: Open-source Windows 11 backup solutions | news.ycombinator.com | 2024-04-04

i use - and recommend - "borgbackup": for example with the "vorta" graphical frontend
* https://www.borgbackup.org/
* https://vorta.borgbase.com/install/windows/
just my 0.02€

Crunch

1 3,325 10.0 Python

Insane(ly slow but wicked good) PNG image optimization (by chrissimpkins)
unblob

16 2,046 9.5 Python

Extract files from any kind of container formats

Project mention: Reverse-engineering an encrypted IoT protocol | news.ycombinator.com | 2024-02-14

aimet

2 1,900 9.6 Python

AIMET is a library that provides advanced quantization and compression techniques for trained neural network models.
borgmatic

61 1,639 9.5 Python

Simple, configuration-driven backup software for servers and workstations

Project mention: Rclone syncs your files to cloud storage | news.ycombinator.com | 2024-01-26

- for important files, a separate box where I have borgmatic [1] in deduplication mode installed; this is updated once in a while
Just curious: Do you have any reason to believe that such a data corruption bug is likely in ZFS? It seems like saying that ext4 could have a bug and you should also store stuff on NTFS, just in case (which I think does not make sense..).
[1]: https://github.com/borgmatic-collective/borgmatic

InfluxDB

www.influxdata.com sponsored

Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
model-optimization

1 1,465 6.7 Python

A toolkit to optimize ML models for deployment for Keras and TensorFlow, including quantization and pruning.
compression

1 823 6.6 Python

Data compression in TensorFlow (by tensorflow)
nncf

2 777 9.7 Python

Neural Network Compression Framework for enhanced OpenVINO™ inference

Project mention: FLaNK Stack Weekly 06 Nov 2023 | dev.to | 2023-11-06

Awesome-Efficient-LLM

1 775 9.4 Python

A curated list for Efficient Large Language Models

Project mention: A Curated List for Efficient Large Language Models | news.ycombinator.com | 2023-07-15

ratarmount

10 628 8.6 Python

Access large archives as a filesystem efficiently, e.g., TAR, RAR, ZIP, GZ, BZ2, XZ, ZSTD archives

Project mention: Ratarmount: Access large archives as a filesystem efficiently | news.ycombinator.com | 2024-04-10

refinery

2 588 9.7 Python

High Octane Triage Analysis (by binref)
swin2sr

2 526 2.6 Python

Swin2SR: SwinV2 Transformer for Compressed Image Super-Resolution and Restoration. Advances in Image Manipulation (AIM) workshop ECCV 2022. Try it out! over 3.3M runs https://replicate.com/mv-lab/swin2sr
zipfly

11 518 4.0 Python

Python Zip Stream
pythonlibs

0 341 3.7 Python

A Python wrapper for the extremely fast Blosc compression library
3d-model-convert-to-gltf

1 231 0.0 Python

Convert 3d model (STL/IGES/STEP/OBJ/FBX) to gltf and compression
DictDataBase

9 220 7.8 Python

A python NoSQL dictionary database, with concurrent access and ACID compliance
SecretPixel

2 187 7.0 Python

SecretPixel is a cutting-edge steganography tool designed to securely conceal sensitive information within images. It stands out in the realm of digital steganography by combining advanced encryption, compression, and a seeded Least Significant Bit (LSB) technique to provide a robust solution for embedding data undetectably.

Project mention: Show HN: SecretPixel – Advanced Image Steganography Tool | news.ycombinator.com | 2024-02-03

I've tested Binwalk on all the example files, and the BMP and TGA samples didn't show any zlib compressed data (https://i.ibb.co/R4WWsV2/2024-02-04-16h22-31.png). Can you please confirm that you have used the files from the example folder (https://github.com/x011/SecretPixel/tree/main/examples)? I will try to work on the PNG and TIFF cases soon. Thanks for the paper :)

georaptor

1 187 4.7 Python

Python Geohash Compression Tool

Project mention: Show HN: Georaptor – A Python Geohash Compression Tool v3.0.0 | news.ycombinator.com | 2024-04-03

KVQuant

1 183 5.3 Python

KVQuant: Towards 10 Million Context Length LLM Inference with KV Cache Quantization

Project mention: 10M Tokens LLM Context | news.ycombinator.com | 2024-02-02

archive-pdf-tools

3 80 5.7 Python

Fast PDF generation and compression. Deals with millions of pages daily.

Project mention: How to Entertain with Your Pocket Calculator (1975) | news.ycombinator.com | 2023-05-14

(author of the PDF generation sw here, thanks to jrochkind1 for pointing me in this direction) Would you mind filing a bug here with your findings, so that we can figure out what is going on? https://github.com/internetarchive/archive-pdf-tools
We (archive.org) tested the software extensively with the various readers, and we haven't found problems with the PDFs on Apple Preview.

reCBZ

7 60 0.0 Python

Utility for converting and optimizing comics for e-readers & mobile devices

Project mention: Looking for Epub files of full manga series/volumes | /r/mangapiracy | 2023-06-08

SaaSHub

www.saashub.com sponsored

SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives

NOTE: The open source projects on this list are ordered by number of github stars. The number of mentions indicates repo mentiontions in the last 12 Months or since we started tracking (Dec 2020).

Python Compression related posts

Ratarmount: Access large archives as a filesystem efficiently
1 project | news.ycombinator.com | 10 Apr 2024
TSAC: Low Bitrate Audio Compression
4 projects | news.ycombinator.com | 8 Apr 2024
Show HN: SecretPixel – Advanced Image Steganography Tool
3 projects | news.ycombinator.com | 3 Feb 2024
What do you use for VPS backup? Would improved borg setup - pull mode - be enough? Or, do you use something else?
1 project | /r/selfhosted | 5 Dec 2023
Borg CVE fix requires migration
1 project | news.ycombinator.com | 10 Oct 2023
disc space is not freeing
1 project | /r/openSUSE | 25 Jun 2023
I installed Arch today!
1 project | /r/linux4noobs | 12 Jun 2023
A note from our sponsor - WorkOS
workos.com | 26 Apr 2024

The APIs are flexible and easy-to-use, supporting authentication, user identity, and complex enterprise features like SSO and SCIM provisioning. Learn more →

Index

What are some of the best open-source Compression projects in Python? This list will help you:

	Project	Stars
1	DeepSpeed	32,550
2	PaddleNLP	11,423
3	BorgBackup	10,526
4	Crunch	3,325
5	unblob	2,046
6	aimet	1,900
7	borgmatic	1,639
8	model-optimization	1,465
9	compression	823
10	nncf	777
11	Awesome-Efficient-LLM	775
12	ratarmount	628
13	refinery	588
14	swin2sr	526
15	zipfly	518
16	pythonlibs	341
17	3d-model-convert-to-gltf	231
18	DictDataBase	220
19	SecretPixel	187
20	georaptor	187
21	KVQuant	183
22	archive-pdf-tools	80
23	reCBZ	60