trino

Top 17 trino Open-Source Projects

  • Trino

    Official repository of Trino, the distributed SQL query engine for big data, formerly known as PrestoSQL (https://trino.io)

  • Project mention: Trino: Fast distributed SQL query engine for big data analytics | news.ycombinator.com | 2024-03-19
  • sqlglot

    Python SQL Parser and Transpiler

  • Project mention: The Future of MySQL is PostgreSQL: an extension for the MySQL wire protocol | news.ycombinator.com | 2024-04-26

    This is probably referring to "zero changes to your driver code" and not "zero changes to the SQL you send over this driver".

    Translating between SQL dialects is notoriously hard and attempts to translate [1] are working in 95% of cases. But the last 5% would require 5x amount of work. That's because "SQL dialect" also includes weird edge cases of type inference of things like COALESCE(5, FALSE) and emulation of system catalogs (pg_catalog, information_schema).

    [1] https://github.com/tobymao/sqlglot

  • InfluxDB

    Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

    InfluxDB logo
  • ibis

    the portable Python dataframe library

  • Project mention: Show HN: Hashquery, a Python library for defining reusable analysis | news.ycombinator.com | 2024-04-23

    I really don't understand the appeal of dbt vs a proper programming language. The templating approach leads to massive spaghetti. I look forward to trying out something like Ibis [0]

    0: https://ibis-project.org/

  • data-diff

    Compare tables within or across databases

  • Project mention: How to Check 2 SQL Tables Are the Same | news.ycombinator.com | 2023-07-26

    If the issue happen a lot, there is also: https://github.com/datafold/data-diff

    That is a nice tool to do it cross database as well.

    I think it's based on checksum method.

  • logica

    Logica is a logic programming language that compiles to SQL. It runs on Google BigQuery, PostgreSQL and SQLite.

  • Project mention: Prolog language for PostgreSQL proof of concept | news.ycombinator.com | 2024-03-30

    If you're interested in this I would also recommend you check out Logica[0], which is a datalog-like language that is explicitly made to compile to SQL queries.

    0: https://logica.dev/

  • jupysql

    Better SQL in Jupyter. 📊

  • Project mention: Show HN: JupySQL – a SQL client for Jupyter (ipython-SQL successor) | news.ycombinator.com | 2023-12-06

    Hey, HN community!

    We're stoked to launch JupySQL today! JupySQL is an open-source library that brings a modern SQL experience to Jupyter. JupySQL is compatible with all major databases, such as Snowflake, Redshift, PostgreSQL, MySQL, MariaDB, DuckDB, SQL Server, Clickhouse, Trino, and more!

    To get started, check out our tutorial: https://jupysql.ploomber.io/en/latest/quick-start.html

    SQL is the defacto language for data analysis; however, analysis often requires a mix of SQL and Python. JupySQL bridges this gap, allowing users to execute SQL queries seamlessly in Jupyter and continue their analysis in Python. Add %%sql to the top of your cell and start writing SQL.

    Here are some of JupySQL's main features:

    - Syntax highlighting

  • versatile-data-kit

    One framework to develop, deploy and operate data workflows with Python and SQL.

  • Project mention: Looking for a data blogger | /r/opensource | 2023-05-19

    Here's the project: https://github.com/vmware/versatile-data-kit

  • SaaSHub

    SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives

    SaaSHub logo
  • ngods-stocks

    New Generation Opensource Data Stack Demo

  • quix

    Quix Notebook Manager

  • trino-getting-started

  • prism

    Prism is the easiest way to develop, orchestrate, and execute data pipelines in Python. (by runprism)

  • Project mention: Prism: the easiest way to create robust data workflows. Accessible via CLI | /r/coolgithubprojects | 2023-09-21
  • itachi

    A library that brings useful functions from various modern database management systems to Apache Spark

  • hive-metastore

    Apache Hive Metastore as a Standalone server in Docker

  • Project mention: FLaNK Stack Weekly for 20 June 2023 | dev.to | 2023-06-20
  • Local-Data-LakeHouse

    Sample Data Lakehouse deployed in Docker containers using Apache Iceberg, Minio, Trino and a Hive Metastore. Can be used for local testing.

  • lasagna

    A Docker Compose template that builds a interactive development environment for PySpark with Jupyter Lab, MinIO as object storage, Hive Metastore, Trino and Kafka

  • Project mention: FLaNK Stack Weekly for 20 Nov 2023 | dev.to | 2023-11-20
  • trino-aerospike.docker

    Trino with the Aeropsike connector Docker Image

  • trino-pubsub-event-listener

    Trino Google Pub/Sub event listener

  • Project mention: Trino Google Pub/Sub event listener | /r/Trino | 2023-05-04
  • SaaSHub

    SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives

    SaaSHub logo
NOTE: The open source projects on this list are ordered by number of github stars. The number of mentions indicates repo mentiontions in the last 12 Months or since we started tracking (Dec 2020).

trino related posts

  • Looking for a data blogger

    1 project | /r/opensource | 19 May 2023
  • Need advice on ETL tool

    1 project | /r/dataengineering | 29 Mar 2023
  • If dbt is the "T" part of an "ELT", what do you use for "EL"?

    1 project | /r/dataengineering | 2 Mar 2023
  • Best way to fix errors in my data?

    1 project | /r/dataengineering | 1 Feb 2023
  • What Orchestration Tool do you use for batch ETL/ELT?

    1 project | /r/dataengineering | 31 Jan 2023
  • Dear, pipeline builders! Which step in your role is the most time consuming?

    1 project | /r/dataengineering | 16 Jan 2023
  • Problem definition / vibe check for a repo

    1 project | /r/opensource | 22 Dec 2022
  • A note from our sponsor - InfluxDB
    www.influxdata.com | 1 May 2024
    Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality. Learn more →

Index

What are some of the best open-source trino projects? This list will help you:

Project Stars
1 Trino 9,576
2 sqlglot 5,511
3 ibis 4,208
4 data-diff 2,842
5 logica 1,680
6 jupysql 605
7 versatile-data-kit 410
8 ngods-stocks 373
9 quix 267
10 trino-getting-started 226
11 prism 79
12 itachi 53
13 hive-metastore 53
14 Local-Data-LakeHouse 44
15 lasagna 28
16 trino-aerospike.docker 4
17 trino-pubsub-event-listener 2

Sponsored
SaaSHub - Software Alternatives and Reviews
SaaSHub helps you find the best software and product alternatives
www.saashub.com