duckdb_fdw
odbc2parquet
Our great sponsors
duckdb_fdw | odbc2parquet | |
---|---|---|
4 | 5 | |
235 | 204 | |
- | - | |
7.3 | 9.3 | |
2 months ago | 13 days ago | |
PLpgSQL | Rust | |
MIT License | MIT License |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
duckdb_fdw
- Querying Postgres Tables Directly from DuckDB
- Postgres and Parquet in the Data Lke
-
DuckDB quacks Arrow: A zero-copy data integration between Arrow and DuckDB
I should also add that there is a duckdb fdw, so you could have DuckDB read from your parquet files and do faster transformations before you pull your data into Postgres!
https://github.com/alitrack/duckdb_fdw
- DuckDB Postgres Foreign Data Wrapper
odbc2parquet
- Postgres and Parquet in the Data Lke
-
MySQL table data to direct parquet output
Although, I found a GitHub page (odbc2parquet) which can export the table (also a query output) to parquet.
-
Parquet best practices
Is this a one-time task? Maybe check out ODBC2PARQUET https://github.com/pacman82/odbc2parquet
-
Thoughts on Using Airbyte to read/write to S3?
I tried writing parquet to s3 with Airbyte a few months ago and gave up. It was extremely slow for small tables and would not work at all for larger tables. I wound up using this https://github.com/pacman82/odbc2parquet + aws cli
-
Extract data from ERP systems to Snowflake - Which tools (besides Airbyte)?
Yes, I have been tinkering around with odbc2parquet (https://github.com/pacman82/odbc2parquet) and storing it in a variant column. For the dependency/workflow management maybe prefect
What are some alternatives?
subzero-starter-kit - Starter Kit and tooling for authoring GraphQL/REST API backends with subZero
sql-spark-connector - Apache Spark Connector for SQL Server and Azure SQL
aquameta - Web development platform built entirely in PostgreSQL
roapi - Create full-fledged APIs for slowly moving datasets without writing a single line of code.
postgres_vectorization_test - Vectorized executor to speed up PostgreSQL
sqlpad - Web-based SQL editor. Legacy project in maintenance mode.
parquet_fdw - Parquet foreign data wrapper for PostgreSQL
geoparquet - Specification for storing geospatial vector data (point, line, polygon) in Parquet
parquet_s3_fdw - ParquetS3 Foreign Data Wrapper for PostgresSQL
FreeSql - 🦄 .NET aot orm, C# orm, VB.NET orm, Mysql orm, Postgresql orm, SqlServer orm, Oracle orm, Sqlite orm, Firebird orm, 达梦 orm, 人大金仓 orm, 神通 orm, 翰高 orm, 南大通用 orm, 虚谷 orm, 国产 orm, Clickhouse orm, QuestDB orm, MsAccess orm.
cstore_fdw - Columnar storage extension for Postgres built as a foreign data wrapper. Check out https://github.com/citusdata/citus for a modernized columnar storage implementation built as a table access method.