Would ParquetWriter from pyarrow automatically flush?

Our great sponsors

WorkOS - The modern identity platform for B2B SaaS

InfluxDB - Power Real-Time Data Analytics at Scale

SaaSHub - Software Alternatives and Reviews

Our great sponsors

kudu

3 1,799 9.2 C++

Mirror of Apache Kudu (by apache)
hudi

20 5,066 9.9 Java

Upserts, Deletes And Incremental Processing on Big Data.
WorkOS

workos.com sponsored

The modern identity platform for B2B SaaS. The APIs are flexible and easy-to-use, supporting authentication, user identity, and complex enterprise features like SSO and SCIM provisioning.
iceberg

18 5,508 9.9 Java

Apache Iceberg

https://github.com/apache/iceberg https://iceberg.apache.org/ > Hidden partitioning prevents user mistakes that cause silently incorrect results or extremely slow queries > Version rollback allows users to quickly correct problems by resetting tables to a good state > Multiple concurrent writers use optimistic concurrency and will retry to ensure that compatible updates succeed, even when writes conflict

Dask

32 11,999 9.6 Python

Parallel computing with task scheduling

"Support pyarrow.dataset API for read_parquet" https://github.com/dask/dask/pull/6534

NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a more popular project.

Suggest a related project