parquet2
odbc2parquet
Our great sponsors
parquet2 | odbc2parquet | |
---|---|---|
6 | 5 | |
347 | 204 | |
- | - | |
3.2 | 9.3 | |
7 months ago | 3 days ago | |
Rust | Rust | |
GNU General Public License v3.0 or later | MIT License |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
parquet2
-
Rust is showing a lot of promise in the DataFrame / tabular data space
[arrow2](https://github.com/jorgecarleitao/arrow2) and [parquet2](https://github.com/jorgecarleitao/parquet2) are great foundational libraries for and DataFrame libs in Rust.
-
::lending-iterator — Lending/streaming Iterators on Stable Rust (and a pinch of HKT)
This is so freaking life-saving! - we have been using StreamingIterator and FallibleStreamingIterator in libraries (arrow2 and parquet2) and the existing landscape is quite confusing for new users!
- Anda para aqui alguém a brincar com Rust (linguagem)?
-
Parquet2 0.9 released (and a request for feedback)
Thanks a lot for your feedback. Based on it I am proposing the following change: https://github.com/jorgecarleitao/parquet2/pull/78
-
parquet2 0.3.0, with native support to read async
release on github.
odbc2parquet
- Postgres and Parquet in the Data Lke
-
MySQL table data to direct parquet output
Although, I found a GitHub page (odbc2parquet) which can export the table (also a query output) to parquet.
-
Parquet best practices
Is this a one-time task? Maybe check out ODBC2PARQUET https://github.com/pacman82/odbc2parquet
-
Thoughts on Using Airbyte to read/write to S3?
I tried writing parquet to s3 with Airbyte a few months ago and gave up. It was extremely slow for small tables and would not work at all for larger tables. I wound up using this https://github.com/pacman82/odbc2parquet + aws cli
-
Extract data from ERP systems to Snowflake - Which tools (besides Airbyte)?
Yes, I have been tinkering around with odbc2parquet (https://github.com/pacman82/odbc2parquet) and storing it in a variant column. For the dependency/workflow management maybe prefect
What are some alternatives?
parquet-format-rs - Apache Parquet format for Rust, hosting the Thrift definition file and the generated .rs file
sql-spark-connector - Apache Spark Connector for SQL Server and Azure SQL
rust-brotli - Brotli compressor and decompressor written in rust that optionally avoids the stdlib
roapi - Create full-fledged APIs for slowly moving datasets without writing a single line of code.
sqlpad - Web-based SQL editor. Legacy project in maintenance mode.
arrow2 - Transmute-free Rust library to work with the Arrow format
geoparquet - Specification for storing geospatial vector data (point, line, polygon) in Parquet
inkwell - It's a New Kind of Wrapper for Exposing LLVM (Safely)
duckdb_fdw - DuckDB Foreign Data Wrapper for PostgreSQL
pqrs - Command line tool for inspecting Parquet files
cstore_fdw - Columnar storage extension for Postgres built as a foreign data wrapper. Check out https://github.com/citusdata/citus for a modernized columnar storage implementation built as a table access method.