Revolutionize your code reviews with AI. CodeRabbit offers PR summaries, code walkthroughs, 1-click suggestions, and AST-based analysis. Boost productivity and code quality across all major languages with each PR. Learn more โ
Top 9 Python trino Projects
-
This could be a nice option to add sqlglot here. As an advanced sql parsing library.
-
Judoscale
Save 47% on cloud hosting with autoscaling that just works. Judoscale integrates with Django, FastAPI, Celery, and RQ to make autoscaling easy and reliable. Save big, and say goodbye to request timeouts and backed-up task queues.
-
I felt the same - have to relearn/lookup everything every time I went back to a project or wanted to do some operations that are simple to describe in SQL but I couldn't wrap my mind around e.g. using multi-indexed dataframes & aggregations properly. These days, I always jump to Polars instead of Pandas - much more intuitive and consistent API. Tons of props to Pandas for all that they did (and continue to do) in the data space, but their API did not evolve very well IMO.
I've also been wanting to play with Ibis[1] recently, but Polars has been sufficient for me.
[1] https://ibis-project.org/
-
-
-
-
prism
Prism is the easiest way to develop, orchestrate, and execute data pipelines in Python. (by runprism)
-
InfluxDB
InfluxDB high-performance time series database. Collect, organize, and act on massive volumes of high-resolution data to power real-time intelligent systems.
-
-
Project mention: AthenaSQL: SQL query builder for AWS Athena, inspired by pySpark SQL | news.ycombinator.com | 2024-11-04
Hi Everyone,
I work in adtech, where we handle massive log-level data. To cut costs and improve performance for ML and optimization, my team and I chose a lakehouse approach using AWS (S3 + OTFs / partitioned Parquet + Athena + Glue).
One challenge we faced with this data stack was managing Athena queries in our ETL jobs. Since Athena handles much of our data-heavy processing, we ended up storing hundreds of lines of query code as strings in Python scripts, which quickly became a nightmare to maintain.
We needed something similar to PySpark SQL that could output SQL string compatible with Athena. So we built athenaSQL. It mimics the PySpark SQL API, providing a familiar interface and outputting SQL queries directly.
It is far from complete at the moment but it has most of the basic query statements. I would love it if you could test it out and share any feedback! I hope someone is in need of such a tool, if it lacks the functionality you are seeking, letโs build it together! And feel free to critique it as much as you like. :)
github: https://github.com/nabilseid/athenaSQL
docs: github.com/nabilseid/athenaSQL
Python trino discussion
Python trino related posts
-
FireDucks: Pandas but 100x Faster
-
Looking for a data blogger
-
Need advice on ETL tool
-
If dbt is the "T" part of an "ELT", what do you use for "EL"?
-
Best way to fix errors in my data?
-
What Orchestration Tool do you use for batch ETL/ELT?
-
Dear, pipeline builders! Which step in your role is the most time consuming?
-
A note from our sponsor - CodeRabbit
coderabbit.ai | 23 Apr 2025
Index
What are some of the best open-source trino projects in Python? This list will help you:
# | Project | Stars |
---|---|---|
1 | sqlglot | 7,531 |
2 | ibis | 5,699 |
3 | jupysql | 769 |
4 | versatile-data-kit | 446 |
5 | trino-getting-started | 262 |
6 | dbt-trino | 232 |
7 | prism | 85 |
8 | hive-metastore | 74 |
9 | athenaSQL | 6 |