C++ Data Analysis

Open-source C++ projects categorized as Data Analysis

Top 17 C++ Data Analysis Projects

Data Analysis
  1. cudf

    cuDF - GPU DataFrame Library

    Project mention: Unleashing GPU Power: Supercharge Your Data Processing with cuDF | dev.to | 2024-06-21

    cuDF Documentation

  2. InfluxDB

    InfluxDB – Built for High-Performance Time Series Workloads. InfluxDB 3 OSS is now GA. Transform, enrich, and act on time series data directly in the database. Automate critical tasks and eliminate the need to move data externally. Download now.

    InfluxDB logo
  3. matplotplusplus

    Matplot++: A C++ Graphics Library for Data Visualization 📊🗾

  4. root

    The official repository for ROOT: analyzing, storing and visualizing big data, scientifically

    Project mention: ICPP – Running C++ in anywhere like a script | news.ycombinator.com | 2024-08-03

    Folks who like this kind of thing should definitely check out CERN's Root framework. I've been using its C++ interpreter in a Jupyter notebook environment to learn C++. It's probably also quite a bit more mature than this project. https://root.cern/

  5. DataFrame

    C++ DataFrame for statistical, Financial, and ML analysis -- in modern C++ using native types and contiguous memory storage

  6. TileDB

    The Universal Storage Engine

    Project mention: Ask HN: Who is hiring? (February 2025) | news.ycombinator.com | 2025-02-03

    TileDB, Inc. | Full-time | REMOTE | USA, Greece | https://tiledb.com/

    TileDB is the database designed for discovery, built to organize, structure, and analyze any data. Our solutions for single-cell and population genomics are used by major pharmaceutical companies and research institutes, and power large public data collections such as the Cellxgene Discover Census. We are actively hiring for several roles building our unified data catalog, scalable computation, and interactive analysis platform.

    - Infrastructure Engineer: Kubernetes, Terraform, Argo, Grafana, Prometheus, CloudWatch, GitOps; Golang, Python, C++, or Rust (GMT -8/+4).

    - Frontend/UI developer: Typescript, React; experience with high-performance/high-volume data and visualization applications. GMT -8/+1

    We are fully-remote, with optional co-working hubs in Cambridge, MA, New York, NY, and Athens, Greece. Apply today at https://ats.rippling.com/tiledb-careers/jobs or reach out directly (email in profile).

  7. ArcticDB

    ArcticDB is a high performance, serverless DataFrame database built for the Python Data Science ecosystem.

    Project mention: ArcticDB: High performance, serverless DataFrame database | news.ycombinator.com | 2024-09-06
  8. datatable

    A Python package for manipulating 2-dimensional tabular data structures

  9. SaaSHub

    SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives

    SaaSHub logo
  10. oneDAL

    oneAPI Data Analytics Library (oneDAL)

  11. gdl

    GDL - GNU Data Language

  12. AlphaPlot

    :chart_with_upwards_trend: Application for statistical analysis and data visualization which can generate different types of publication quality 2D and 3D plots with extensive visual customization.

  13. Graphia

    A visualisation tool for the creation and analysis of graphs

  14. volbx

    Graphical tool for data manipulation written in C++/Qt.

  15. nebula

    A distributed block-based data storage and compute engine (by varchar-io)

  16. vinum

    Vinum is a SQL processor for Python, designed for data analysis workflows and in-memory analytics.

  17. MachineLearning

    From linear regression towards neural networks... (by aromanro)

  18. vif

    Easy, robust, and fast numerics in C++. (by cschreib)

  19. lesser_pandas

    Data Analysis library in C++

    Project mention: Show HN: Lesser Pandas – Data Analysis Library in C++ | news.ycombinator.com | 2025-05-22
  20. SaaSHub

    SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives

    SaaSHub logo
NOTE: The open source projects on this list are ordered by number of github stars. The number of mentions indicates repo mentiontions in the last 12 Months or since we started tracking (Dec 2020).

C++ Data Analysis discussion

Log in or Post with

C++ Data Analysis related posts

Index

What are some of the best open-source Data Analysis projects in C++? This list will help you:

# Project Stars
1 cudf 8,978
2 matplotplusplus 4,604
3 root 2,871
4 DataFrame 2,711
5 TileDB 1,946
6 ArcticDB 1,934
7 datatable 1,865
8 oneDAL 637
9 gdl 292
10 AlphaPlot 257
11 Graphia 248
12 volbx 246
13 nebula 154
14 vinum 65
15 MachineLearning 25
16 vif 11
17 lesser_pandas 8

Sponsored
InfluxDB – Built for High-Performance Time Series Workloads
InfluxDB 3 OSS is now GA. Transform, enrich, and act on time series data directly in the database. Automate critical tasks and eliminate the need to move data externally. Download now.
www.influxdata.com