C++ Arrow

Open-source C++ projects categorized as Arrow

Top 6 C++ Arrow Projects

  1. Apache Arrow

    Apache Arrow is the universal columnar format and multi-language toolbox for fast data interchange and in-memory analytics

    Project mention: Show HN: Aiopandas – Async .apply() and .map() for Pandas, Faster API/LLMs Calls | news.ycombinator.com | 2025-03-15

    https://github.com/apache/arrow/blob/main/python/pyarrow/tes...

    pyarrow/src/arrow/python/async.h:

  2. InfluxDB

    InfluxDB – Built for High-Performance Time Series Workloads. InfluxDB 3 OSS is now GA. Transform, enrich, and act on time series data directly in the database. Automate critical tasks and eliminate the need to move data externally. Download now.

    InfluxDB logo
  3. cudf

    cuDF - GPU DataFrame Library

    Project mention: Unleashing GPU Power: Supercharge Your Data Processing with cuDF | dev.to | 2024-06-21

    cuDF Documentation

  4. ustore

    Multi-Modal Database replacing MongoDB, Neo4J, and Elastic with 1 faster ACID solution, with NetworkX and Pandas interfaces, and bindings for C 99, C++ 17, Python 3, Java, GoLang 🗄️

  5. duckdb-airport-extension

    The Airport extension for DuckDB, enables the use of Arrow Flight with DuckDB

    Project mention: The DuckDB Local UI | news.ycombinator.com | 2025-03-12

    That's one way of looking at it. To me this UI seems like both a useful tool and an advertisement.

    There's another way this could have gone. DuckDB Labs might have published the extension as providing official HTTP API for all to use. Then simultaneously MotherDuck would announce support for it in their UI. Now with access to any and all databases whether in-browser, anywhere through official HTTP API or in their managed cloud service.

    I for one would like HTTP API for some things that now necessitates doing my own in Python. I don't see yet much need for the UI. I'm not looking for public, multiuser service. Just something that I can use locally which doesn't have to be inside a process (such as Python or web browser). There's such API in the extension now, but it's without docs and in C++ [1]. There's also the option of using 3rd party community extension that also does HTTP API [2]. Then there's one that supports remote access with Arrow Flight, but gRPC only it seems [3]. But official, stable version would be nice.

    [1] https://github.com/duckdb/duckdb-ui/blob/main/src/http_serve...

    [2] https://duckdb.org/community_extensions/extensions/httpserve...

    [3] https://github.com/Query-farm/duckdb-airport-extension

  6. vinum

    Vinum is a SQL processor for Python, designed for data analysis workflows and in-memory analytics.

  7. pgeon

    Apache Arrow PostgreSQL connector

    Project mention: Pgeon: Apache Arrow PostgreSQL connector in C++ | news.ycombinator.com | 2024-08-11
NOTE: The open source projects on this list are ordered by number of github stars. The number of mentions indicates repo mentiontions in the last 12 Months or since we started tracking (Dec 2020).

C++ Arrow discussion

Log in or Post with

C++ Arrow related posts

  • Adding concurrent read/write to DuckDB with Arrow Flight

    2 projects | news.ycombinator.com | 29 Jan 2025
  • Unlocking DuckDB from Anywhere - A Guide to Remote Access with Apache Arrow and Flight RPC (gRPC)

    4 projects | dev.to | 12 Dec 2024
  • Kotlin DataFrame ❤️ Arrow

    3 projects | dev.to | 10 Oct 2024
  • Random access string compression with FSST and Rust

    3 projects | news.ycombinator.com | 12 Sep 2024
  • Unleashing GPU Power: Supercharge Your Data Processing with cuDF

    1 project | dev.to | 21 Jun 2024
  • The Simdjson Library

    4 projects | news.ycombinator.com | 3 Jun 2024
  • cuDF – GPU DataFrame Library

    2 projects | news.ycombinator.com | 2 Jun 2024
  • A note from our sponsor - InfluxDB
    www.influxdata.com | 15 May 2025
    InfluxDB 3 OSS is now GA. Transform, enrich, and act on time series data directly in the database. Automate critical tasks and eliminate the need to move data externally. Download now. Learn more →

Index

What are some of the best open-source Arrow projects in C++? This list will help you:

# Project Stars
1 Apache Arrow 15,388
2 cudf 8,917
3 ustore 573
4 duckdb-airport-extension 194
5 vinum 65
6 pgeon 59

Sponsored
InfluxDB – Built for High-Performance Time Series Workloads
InfluxDB 3 OSS is now GA. Transform, enrich, and act on time series data directly in the database. Automate critical tasks and eliminate the need to move data externally. Download now.
www.influxdata.com

Did you know that C++ is
the 7th most popular programming language
based on number of references?