C Deduplication

Open-source C projects categorized as Deduplication

Top 6 C Deduplication Projects

  • libpostal

    A C library for parsing/normalizing street addresses around the world. Powered by statistical NLP and open geo data.

  • rmlint

    Extremely fast tool to remove duplicates and other lint from your filesystem

  • Project mention: fdupes: Identify or Delete Duplicate Files | news.ycombinator.com | 2023-11-02

    My preferred solution is rmlint [https://github.com/sahib/rmlint] mostly because it also looks at duplicate directories. It produces a bash script instead of deleting anything itself, so you can examine it before running the script it made.

  • InfluxDB

    Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

    InfluxDB logo
  • kvdo

    A kernel module which provide a pool of deduplicated and/or compressed block storage.

  • vdo

    Userspace tools for managing VDO volumes.

  • dupd

    CLI utility to find duplicate files

  • swuniq

    A command-line tool for deduplicating entries in a file or stream with constant memory usage

NOTE: The open source projects on this list are ordered by number of github stars. The number of mentions indicates repo mentiontions in the last 12 Months or since we started tracking (Dec 2020). The latest post mention was on 2023-11-02.

C Deduplication related posts


What are some of the best open-source Deduplication projects in C? This list will help you:

Project Stars
1 libpostal 3,943
2 rmlint 1,768
3 kvdo 236
4 vdo 188
5 dupd 109
6 swuniq 5

SaaSHub - Software Alternatives and Reviews
SaaSHub helps you find the best software and product alternatives