CSV or Parquet File Format

This page summarizes the projects mentioned and recommended in the original post on /r/Python

Stream - Scalable APIs for Chat, Feeds, Moderation, & Video.
Stream helps developers build engaging apps that scale to millions with performant and flexible Chat, Feeds, Moderation, and Video APIs and SDKs powered by a global edge network and enterprise-grade infrastructure.
getstream.io
featured
InfluxDB – Built for High-Performance Time Series Workloads
InfluxDB 3 OSS is now GA. Transform, enrich, and act on time series data directly in the database. Automate critical tasks and eliminate the need to move data externally. Download now.
www.influxdata.com
featured
  1. delta

    An open-source storage framework that enables building a Lakehouse architecture with compute engines including Spark, PrestoDB, Flink, Trino, and Hive and APIs (by delta-io)

    I prefer parquet (or delta for larger datasets. CSV for very small datasets, or the ones that will be later used/edited in Excel or Googke sheets.

  2. Stream

    Stream - Scalable APIs for Chat, Feeds, Moderation, & Video. Stream helps developers build engaging apps that scale to millions with performant and flexible Chat, Feeds, Moderation, and Video APIs and SDKs powered by a global edge network and enterprise-grade infrastructure.

    Stream logo
  3. Apache Arrow

    Apache Arrow is the universal columnar format and multi-language toolbox for fast data interchange and in-memory analytics

    In fact I have asked Apache Github how to read select column of particular row group of a parquet file. https://github.com/apache/arrow/issues/35688

  4. DuckDB

    DuckDB is an analytical in-process SQL database management system

    The Parquet-Go library is very complex, not yet success to use it. So I ask whether DuckDB can provide API https://github.com/duckdb/duckdb/issues/7776

NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a more popular project.

Suggest a related project

Related posts

  • Show HN: Serve CSV Files as APIs

    1 project | news.ycombinator.com | 22 Jun 2025
  • Apache DataFusion

    3 projects | news.ycombinator.com | 12 Jan 2025
  • Show HN: Turn CSS files into high performance APIs

    1 project | news.ycombinator.com | 11 Jan 2025
  • Roapi: Create APIs for slow moving datasets without writing code

    1 project | news.ycombinator.com | 16 Jul 2024
  • Why you should keep an eye on Apache DataFusion and its community.

    4 projects | dev.to | 8 Jul 2024

Did you know that C++ is
the 7th most popular programming language
based on number of references?