C Deduplication

Open-source C projects categorized as Deduplication

Top 6 C Deduplication Projects

Deduplication
  1. libpostal

    A C library for parsing/normalizing street addresses around the world. Powered by statistical NLP and open geo data.

  2. InfluxDB

    InfluxDB high-performance time series database. Collect, organize, and act on massive volumes of high-resolution data to power real-time intelligent systems.

    InfluxDB logo
  3. rmlint

    Extremely fast tool to remove duplicates and other lint from your filesystem

    Project mention: Hyperspace | news.ycombinator.com | 2025-02-25

    See the comments on https://news.ycombinator.com/item?id=38113396 for a list of alternatives. I used https://github.com/sahib/rmlint in the past and can't complain.

  4. kvdo

    A kernel module which provide a pool of deduplicated and/or compressed block storage.

  5. vdo

    Userspace tools for managing VDO volumes.

    Project mention: VDO: Userspace tools for pools of deduplicated and compressed block storage | news.ycombinator.com | 2024-05-14
  6. dupd

    CLI utility to find duplicate files

  7. swuniq

    A command-line tool for deduplicating entries in a file or stream with constant memory usage

NOTE: The open source projects on this list are ordered by number of github stars. The number of mentions indicates repo mentiontions in the last 12 Months or since we started tracking (Dec 2020).

C Deduplication discussion

Log in or Post with

C Deduplication related posts

Index

What are some of the best open-source Deduplication projects in C? This list will help you:

# Project Stars
1 libpostal 4,223
2 rmlint 2,069
3 kvdo 243
4 vdo 196
5 dupd 114
6 swuniq 5

Sponsored
InfluxDB high-performance time series database
Collect, organize, and act on massive volumes of high-resolution data to power real-time intelligent systems.
influxdata.com

Did you know that C is
the 6th most popular programming language
based on number of references?