Our great sponsors
-
Apache Arrow
Apache Arrow is a multi-language toolbox for accelerated data interchange and in-memory processing
-
InfluxDB
Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
I'm not tuned into Arrow all that much. I've some of the about and stuff, but the code examples (to my eye) look really complex and complicated. [1]
Could someone point me to a more glossy "arrow flight sql for dummies" examples? What I'm gleaning from this (or am I wrong?) is you could use a JDBC driver + arrow jdbc client and write... SQL? Or is it something a lot different?
Is this the sort of thing where you could just add a plugin to postgres and be arrowified or something?
[1] https://github.com/apache/arrow/blob/release-7.0.0/java/flig...
I have been experimenting in my free time with building a platform that autogenerates GraphQL CRUD API's on top of arbitrary datasources and lets you do federated/distributed queries and cross-datasource joins.
I am using Apache Calcite for this, but am interested in potentially using FlightSQL and Substrait for better performance, since I am targeting OLTP workloads and it's latency-sensitive.
https://github.com/GavinRay97/GraphQLCalcite
Oh, and for anyone interested in pitching in on the Rust implementation, there's an issue logged here along with some discussion: https://github.com/apache/arrow-rs/issues/1323
Related posts
- Full-fledged APIs for slowly moving datasets without writing code
- "thread 'main' panicked at 'no CA certificates found'", when running application in docker container
- Roapi 0.9 release adds support for all cloud storage providers
- Run SQL on CSV, Parquet, JSON, Arrow, Unix Pipes and Google Sheet
- Run SQL on CSV, Parquet, JSON, Arrow, Unix Pipes and Google Sheet