Python trino

Open-source Python projects categorized as trino

Top 9 Python trino Projects

  1. sqlglot

    Python SQL Parser and Transpiler

    Project mention: Duckberg! | dev.to | 2025-03-12

    This could be a nice option to add sqlglot here. As an advanced sql parsing library.

  2. Judoscale

    Save 47% on cloud hosting with autoscaling that just works. Judoscale integrates with Django, FastAPI, Celery, and RQ to make autoscaling easy and reliable. Save big, and say goodbye to request timeouts and backed-up task queues.

    Judoscale logo
  3. ibis

    the portable Python dataframe library

    Project mention: Coding as Craft: Going Back to the Old Gym | news.ycombinator.com | 2025-04-22

    I felt the same - have to relearn/lookup everything every time I went back to a project or wanted to do some operations that are simple to describe in SQL but I couldn't wrap my mind around e.g. using multi-indexed dataframes & aggregations properly. These days, I always jump to Polars instead of Pandas - much more intuitive and consistent API. Tons of props to Pandas for all that they did (and continue to do) in the data space, but their API did not evolve very well IMO.

    I've also been wanting to play with Ibis[1] recently, but Polars has been sufficient for me.

    [1] https://ibis-project.org/

  4. jupysql

    Better SQL in Jupyter. ๐Ÿ“Š

  5. versatile-data-kit

    One framework to develop, deploy and operate data workflows with Python and SQL.

  6. trino-getting-started

  7. dbt-trino

    The Trino (https://trino.io/) adapter plugin for dbt (https://getdbt.com)

  8. prism

    Prism is the easiest way to develop, orchestrate, and execute data pipelines in Python. (by runprism)

  9. InfluxDB

    InfluxDB high-performance time series database. Collect, organize, and act on massive volumes of high-resolution data to power real-time intelligent systems.

    InfluxDB logo
  10. hive-metastore

    Apache Hive Metastore as a Standalone server in Docker

  11. athenaSQL

    SQL builder for AWS Athena, inspired by sparkSQL

    Project mention: AthenaSQL: SQL query builder for AWS Athena, inspired by pySpark SQL | news.ycombinator.com | 2024-11-04

    Hi Everyone,

    I work in adtech, where we handle massive log-level data. To cut costs and improve performance for ML and optimization, my team and I chose a lakehouse approach using AWS (S3 + OTFs / partitioned Parquet + Athena + Glue).

    One challenge we faced with this data stack was managing Athena queries in our ETL jobs. Since Athena handles much of our data-heavy processing, we ended up storing hundreds of lines of query code as strings in Python scripts, which quickly became a nightmare to maintain.

    We needed something similar to PySpark SQL that could output SQL string compatible with Athena. So we built athenaSQL. It mimics the PySpark SQL API, providing a familiar interface and outputting SQL queries directly.

    It is far from complete at the moment but it has most of the basic query statements. I would love it if you could test it out and share any feedback! I hope someone is in need of such a tool, if it lacks the functionality you are seeking, letโ€™s build it together! And feel free to critique it as much as you like. :)

    github: https://github.com/nabilseid/athenaSQL

    docs: github.com/nabilseid/athenaSQL

NOTE: The open source projects on this list are ordered by number of github stars. The number of mentions indicates repo mentiontions in the last 12 Months or since we started tracking (Dec 2020).

Python trino discussion

Log in or Post with

Python trino related posts

  • FireDucks: Pandas but 100x Faster

    4 projects | news.ycombinator.com | 20 Nov 2024
  • Looking for a data blogger

    1 project | /r/opensource | 19 May 2023
  • Need advice on ETL tool

    1 project | /r/dataengineering | 29 Mar 2023
  • If dbt is the "T" part of an "ELT", what do you use for "EL"?

    1 project | /r/dataengineering | 2 Mar 2023
  • Best way to fix errors in my data?

    1 project | /r/dataengineering | 1 Feb 2023
  • What Orchestration Tool do you use for batch ETL/ELT?

    1 project | /r/dataengineering | 31 Jan 2023
  • Dear, pipeline builders! Which step in your role is the most time consuming?

    1 project | /r/dataengineering | 16 Jan 2023
  • A note from our sponsor - CodeRabbit
    coderabbit.ai | 23 Apr 2025
    Revolutionize your code reviews with AI. CodeRabbit offers PR summaries, code walkthroughs, 1-click suggestions, and AST-based analysis. Boost productivity and code quality across all major languages with each PR. Learn more โ†’

Index

What are some of the best open-source trino projects in Python? This list will help you:

# Project Stars
1 sqlglot 7,531
2 ibis 5,699
3 jupysql 769
4 versatile-data-kit 446
5 trino-getting-started 262
6 dbt-trino 232
7 prism 85
8 hive-metastore 74
9 athenaSQL 6

Sponsored
Save 47% on cloud hosting with autoscaling that just works
Judoscale integrates with Django, FastAPI, Celery, and RQ to make autoscaling easy and reliable. Save big, and say goodbye to request timeouts and backed-up task queues.
judoscale.com

Did you know that Python is
the 2nd most popular programming language
based on number of references?