Parquet-format Alternatives

Similar projects and alternatives to parquet-format

polars

144 26,218 10.0 Rust parquet-format VS polars

Dataframes powered by a multithreaded, vectorized query engine, written in Rust
mdBook

100 16,669 8.6 Rust parquet-format VS mdBook

Create book from markdown files. Like Gitbook but implemented in Rust
WorkOS

workos.com sponsored

The modern identity platform for B2B SaaS. The APIs are flexible and easy-to-use, supporting authentication, user identity, and complex enterprise features like SSO and SCIM provisioning.
FLiPStackWeekly

79 14 9.9 parquet-format VS FLiPStackWeekly

FLaNK AI Weekly covering Apache NiFi, Apache Flink, Apache Kafka, Apache Spark, Apache Iceberg, Apache Ozone, Apache Pulsar, and more...
orbstack

36 4,354 6.2 Shell parquet-format VS orbstack

Fast, light, simple Docker containers & Linux machines for macOS
graphic-walker

20 2,235 9.4 TypeScript parquet-format VS graphic-walker

An open source alternative to Tableau. Embeddable visual analytic
rapidgzip

14 317 9.5 C++ parquet-format VS rapidgzip

Gzip Decompression and Random Access for Modern Multi-Core Machines
generative-models

21 22,196 7.6 Python parquet-format VS generative-models

Generative Models by Stability AI
InfluxDB

www.influxdata.com sponsored

Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
configu

17 1,490 9.1 TypeScript parquet-format VS configu

a simple, modern, and secure standard for managing and collaborating software configurations ⚙️✨.
fast_float

15 1,277 8.7 C++ parquet-format VS fast_float

Fast and exact implementation of the C++ from_chars functions for number types: 4x to 10x faster than strtod, part of GCC 12 and WebKit/Safari
gping

13 10,289 8.5 Rust parquet-format VS gping

Ping, but with a graph
background-removal-js

9 5,279 8.0 TypeScript parquet-format VS background-removal-js

Remove backgrounds from images directly in the browser environment with ease and no additional costs or privacy concerns. Explore an interactive demo.
ryu

12 1,152 5.9 C++ parquet-format VS ryu

Converts floating point numbers to decimal strings (by ulfjack)
FastSAM

4 6,839 8.6 Python parquet-format VS FastSAM

Fast Segment Anything
xgen

2 713 7.0 Python parquet-format VS xgen

Salesforce open-source LLMs with 8k sequence length.
wizmap

1 365 7.1 TypeScript parquet-format VS wizmap

Explore and interpret large embeddings in your browser with interactive visualization! 📍
CML_AMP_LLM_Chatbot_Augmented_with_Enterprise_Data

5 45 5.8 Python parquet-format VS CML_AMP_LLM_Chatbot_Augmented_with_Enterprise_Data
papyrus

2 67 7.3 Dart parquet-format VS papyrus

A simple paper backup tool for GnuPG or SSH keys (by ooguz)
evernote-ai-chatbot

4 12 5.5 Python parquet-format VS evernote-ai-chatbot
quack-reduce

2 121 4.8 Python parquet-format VS quack-reduce

A playground for running duckdb as a stateless query engine over a data lake
arrow-tools

1 122 8.6 Rust parquet-format VS arrow-tools

A collection of handy CLI tools to convert CSV and JSON to Apache Arrow and Parquet
SaaSHub

www.saashub.com sponsored

SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives

NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a better parquet-format alternative or higher similarity.

Suggest an alternative to parquet-format

parquet-format reviews and mentions

Posts with mentions or reviews of parquet-format. We have used some of these posts to build our list of alternatives and similar projects. The last one was on 2023-11-16.

Summing columns in remote Parquet files using DuckDB
4 projects | news.ycombinator.com | 16 Nov 2023

Right, there's all sorts of metadata and often stats included in any parquet file: https://github.com/apache/parquet-format#file-format
The offsets of said metadata are well-defined (i.e. in the footer) so for S3 / blob storage so long as you can efficiently request a range of bytes you can pull the metadata without having to read all the data.
FLaNK Stack for 4th of July
15 projects | dev.to | 3 Jul 2023
I have question related to Parquet files and AWS Glue
1 project | /r/dataengineering | 18 Jun 2023

As i read here https://github.com/apache/parquet-format/blob/master/LogicalTypes.md , they are store in Integer formats and these integers represent the number of days (for Date) or number of milliseconds, microseconds or nanoseconds (for DateTime) since 1970-01-01. This works as expected with the parquet file that written by our ETL tool from internal database --> S3, all Data/DateTime columns are Integers, means that in Glue Job, i have to convert these Integers back to Date/Datetime value to do some transformation on them. But when parquet files are written by Spark, they are Date/DateTime (or TimeStamp to be more concise) format not Integers (i checked by read these files again in other Glue Job) and that make me confused.
Parquet: More than just “Turbo CSV”
7 projects | news.ycombinator.com | 3 Apr 2023

Date is confusing with a timezone (UTC or otherwise) and the doco makes no such suggestion.
The Parquet datatypes documentation is pretty clear that there is a flag isAdjustedToUTC to define if the timestamp should be interpreted as having Instant semantics or Local semantics.
https://github.com/apache/parquet-format/blob/master/Logical...
Still no option to include a TZ offset in the data (so the same datum can be interpreted with both Local and Instant semantics) but not bad really.
A note from our sponsor - SaaSHub
www.saashub.com | 28 Apr 2024

SaaSHub helps you find the best software and product alternatives Learn more →