Launch HN: Hydra (YC W22) – Query Any Database via Postgres

This page summarizes the projects mentioned and recommended in the original post on news.ycombinator.com

InfluxDB - Power Real-Time Data Analytics at Scale
Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
www.influxdata.com
featured
SaaSHub - Software Alternatives and Reviews
SaaSHub helps you find the best software and product alternatives
www.saashub.com
featured
  • Multicorn

    Data Access Library

  • This is really nice! Congrats!

    I once started building as a side project something similar but focused on querying cloud resources (like S3 buckets, ec2s, etc... discovering the biggest file from a bucket was trivial with this). I abandoned the project but someone else built a startup on the same concept - even the name was the same: cloudquery.

    I built it using the multicorn [1] postgres extension and it is deligthful of how easy it to get something simple running.

    [1] https://multicorn.org/

  • Apache Calcite

    Apache Calcite

  • For anyone interested, Apache Calcite[0] is an open source data management framework which seems to do many of the same things that Hydra claims to do, but taking a different approach. Operating as a Java library, Calcite contains "adapters" to many different data sources from existing JDBC connectors to Elasticsearch to Cassandra. All of these different data sources can be joined together as desired. Calcite also has it's own optimizer which is able to push down relevant parts of the query to the different data sources. However, you get full SQL on data sources which don't support it, with Calcite executing the remaining bits itself.

    Unfortunately, I would not be too surprised if Calcite was found to be less performance-optimized than Hydra. That said, there are users of Calcite at Google, Uber, Spotify, and others who have made great use of various parts of the framework.

    [0] https://calcite.apache.org/

  • InfluxDB

    Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

    InfluxDB logo
  • steampipe

    Zero-ETL, infinite possibilities. Live query APIs, code & more with SQL. No DB required.

  • metriql

    The metrics layer for your data. Join us at https://metriql.com/slack

  • Presto is pretty successful but its focus is to be distributed query engine, not a proxy layer for the existing query engines. We use Trino ( formerly Presto) as our query layer and do something similar to Hydra at Metriql [1] with a fairly different use-case. Data people provide a semantic layer with the mecrics and expose them to 18+ downstream tools.

    [1]: https://metriql.com

NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a more popular project.

Suggest a related project

Related posts

  • Does Java have an open source package that can execute SQL on txt/csv?

    3 projects | /r/programming | 22 Sep 2021
  • What is Cost-based Optimization?

    4 projects | dev.to | 2 Jun 2021
  • ELT as Compliance Enabler: Running Steampipe with Matillion Data Productivity Cloud

    1 project | dev.to | 25 Apr 2024
  • Steampipe: Dynamically query APIs, code and more with SQL

    1 project | news.ycombinator.com | 4 Apr 2024
  • Show HN: Query Your Sheets with SheetSQL

    9 projects | news.ycombinator.com | 13 Mar 2024