Launch HN: Hydra (YC W22) – Query Any Database via Postgres

InfluxDB - Power Real-Time Data Analytics at Scale

Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

www.influxdata.com

featured

SaaSHub - Software Alternatives and Reviews

SaaSHub helps you find the best software and product alternatives

www.saashub.com

featured

Multicorn

8 694 0.0 Python

Data Access Library

This is really nice! Congrats!
I once started building as a side project something similar but focused on querying cloud resources (like S3 buckets, ec2s, etc... discovering the biggest file from a bucket was trivial with this). I abandoned the project but someone else built a startup on the same concept - even the name was the same: cloudquery.
I built it using the multicorn [1] postgres extension and it is deligthful of how easy it to get something simple running.
[1] https://multicorn.org/

Apache Calcite

28 4,376 9.0 Java

Apache Calcite

For anyone interested, Apache Calcite[0] is an open source data management framework which seems to do many of the same things that Hydra claims to do, but taking a different approach. Operating as a Java library, Calcite contains "adapters" to many different data sources from existing JDBC connectors to Elasticsearch to Cassandra. All of these different data sources can be joined together as desired. Calcite also has it's own optimizer which is able to push down relevant parts of the query to the different data sources. However, you get full SQL on data sources which don't support it, with Calcite executing the remaining bits itself.
Unfortunately, I would not be too surprised if Calcite was found to be less performance-optimized than Hydra. That said, there are users of Calcite at Google, Uber, Spotify, and others who have made great use of various parts of the framework.
[0] https://calcite.apache.org/

InfluxDB

www.influxdata.com featured

Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
steampipe

146 6,401 9.7 Go

Zero-ETL, infinite possibilities. Live query APIs, code & more with SQL. No DB required.
metriql

7 284 1.9 Kotlin

The metrics layer for your data. Join us at https://metriql.com/slack

Presto is pretty successful but its focus is to be distributed query engine, not a proxy layer for the existing query engines. We use Trino ( formerly Presto) as our query layer and do something similar to Hydra at Metriql [1] with a fairly different use-case. Data people provide a semantic layer with the mecrics and expose them to 18+ downstream tools.
[1]: https://metriql.com

NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a more popular project.

Suggest a related project

Does Java have an open source package that can execute SQL on txt/csv?

3 projects | /r/programming | 22 Sep 2021
What is Cost-based Optimization?

4 projects | dev.to | 2 Jun 2021
ELT as Compliance Enabler: Running Steampipe with Matillion Data Productivity Cloud

1 project | dev.to | 25 Apr 2024
Steampipe: Dynamically query APIs, code and more with SQL

1 project | news.ycombinator.com | 4 Apr 2024
Show HN: Query Your Sheets with SheetSQL

9 projects | news.ycombinator.com | 13 Mar 2024

Launch HN: Hydra (YC W22) – Query Any Database via Postgres

This page summarizes the projects mentioned and recommended in the original post on news.ycombinator.com
SQL Big Data Projects steampipe dbt
Post date: 23 Feb 2022

Multicorn

Apache Calcite

InfluxDB

steampipe

metriql

Related posts

Does Java have an open source package that can execute SQL on txt/csv?

What is Cost-based Optimization?

ELT as Compliance Enabler: Running Steampipe with Matillion Data Productivity Cloud

Steampipe: Dynamically query APIs, code and more with SQL

Show HN: Query Your Sheets with SheetSQL

Launch HN: Hydra (YC W22) – Query Any Database via Postgres

This page summarizes the projects mentioned and recommended in the original post on news.ycombinator.com SQL Big Data Projects steampipe dbt Post date: 23 Feb 2022

Multicorn

Apache Calcite

InfluxDB

steampipe

metriql

Related posts

Does Java have an open source package that can execute SQL on txt/csv?

What is Cost-based Optimization?

ELT as Compliance Enabler: Running Steampipe with Matillion Data Productivity Cloud

Steampipe: Dynamically query APIs, code and more with SQL

Show HN: Query Your Sheets with SheetSQL

This page summarizes the projects mentioned and recommended in the original post on news.ycombinator.com
SQL Big Data Projects steampipe dbt
Post date: 23 Feb 2022