C++ Big Data

Open-source C++ projects categorized as Big Data | Edit details

Top 4 C++ Big Data Projects

  • GitHub repo ClickHouse

    ClickHouse® is a free analytics DBMS for big data

    Project mention: Grep one-liners as CI tasks | news.ycombinator.com | 2022-01-14
  • GitHub repo kudu

    Mirror of Apache Kudu (by apache)

    Project mention: Would ParquetWriter from pyarrow automatically flush? | reddit.com/r/learnpython | 2021-09-11
  • SonarLint

    Deliver Cleaner and Safer Code - Right in Your IDE of Choice!. SonarLint is a free and open source IDE extension that identifies and catches bugs and vulnerabilities as you code, directly in the IDE. Install from your favorite IDE marketplace today.

  • GitHub repo PGM-index

    🏅State-of-the-art learned data structure that enables fast lookup, predecessor, range searches and updates in arrays of billions of items using orders of magnitude less space than traditional indexes

    Project mention: PGM Indexes: Learned indexes that match B-tree performance with 83x less space | news.ycombinator.com | 2021-01-25

    Yep, I'm working on a multidimensional version that I hope to upload to the main repo (https://github.com/gvinciguerra/PGM-index) in a few weeks.

  • GitHub repo nebula

    A distributed block-based data storage and compute engine (by varchar-io)

    Project mention: Streaming multi-file SQL and CSV/TSV/etc., native/WASM and fastest CSV parser | news.ycombinator.com | 2022-01-14

    cool - I also hand crafted a CSV parser following RFC4180 a while ago, not sure if you have a repeatable way to benchmark the performance difference?

    https://github.com/varchar-io/nebula/blob/master/src/storage...

NOTE: The open source projects on this list are ordered by number of github stars. The number of mentions indicates repo mentiontions in the last 12 Months or since we started tracking (Dec 2020). The latest post mention was on 2022-01-14.

C++ Big Data related posts

Index

What are some of the best open-source Big Data projects in C++? This list will help you:

Project Stars
1 ClickHouse 21,770
2 kudu 1,525
3 PGM-index 573
4 nebula 98
Find remote jobs at our new job board 99remotejobs.com. There are 29 new remote jobs listed recently.
Are you hiring? Post a new remote job listing for free.
OPS - Build and Run Open Source Unikernels
Quickly and easily build and deploy open source unikernels in tens of seconds. Deploy in any language to any cloud.
github.com/nanovms