Stream helps developers build engaging apps that scale to millions with performant and flexible Chat, Feeds, Moderation, and Video APIs and SDKs powered by a global edge network and enterprise-grade infrastructure. Learn more →
Top 23 Python Snowflake Projects
-
airbyte
The leading data integration platform for ETL / ELT data pipelines from APIs, databases & files to data warehouses, data lakes & data lakehouses. Both self-hosted and Cloud-hosted.
Website: https://airbyte.com/
-
InfluxDB
InfluxDB – Built for High-Performance Time Series Workloads. InfluxDB 3 OSS is now GA. Transform, enrich, and act on time series data directly in the database. Automate critical tasks and eliminate the need to move data externally. Download now.
-
Project mention: Show HN: SQL-tString a t-string SQL builder in Python | news.ycombinator.com | 2025-05-16
https://github.com/tobymao/sqlglot :
> SQLGlot is a no-dependency SQL parser, transpiler, optimizer, and engine [written in Python] . It can be used to format SQL or translate between 24 different dialects like DuckDB, Presto / Trino, Spark / Databricks, Snowflake, and BigQuery. It aims to read a wide variety of SQL inputs and output syntactically and semantically correct SQL in the targeted dialects.
-
Project mention: Why Pandas feels clunky when coming from R (2024) | news.ycombinator.com | 2025-06-07
pandas* per the style guide (nobody follows it)
also I recommend trying Ibis. created by the creator of pandas originally and solves so many of the issues
https://ibis-project.org
-
View the Project on GitHub
-
soda-core
:zap: Data quality testing for the modern data stack (SQL, Spark, and Pandas) https://www.soda.io
-
-
-
Stream
Stream - Scalable APIs for Chat, Feeds, Moderation, & Video. Stream helps developers build engaging apps that scale to millions with performant and flexible Chat, Feeds, Moderation, and Video APIs and SDKs powered by a global edge network and enterprise-grade infrastructure.
-
-
-
-
-
CueObserve
Timeseries Anomaly detection and Root Cause Analysis on data in SQL data warehouses and databases
-
snowflake-cli
Snowflake CLI is an open-source command-line tool explicitly designed for developer-centric workloads in addition to SQL operations.
-
dbt-ml-preprocessing
A SQL port of python's scikit-learn preprocessing module, provided as cross-database dbt macros.
-
-
-
data-observability-installer
Installer for DataKitchen's Open Source Data Observability Products. Data breaks. Servers break. Your toolchain breaks. Ensure your team is the first to know and the first to solve with visibility across and down your data estate. Save time with simple, fast data quality test generation and execution. Trust your data, tools, and systems end to end.
-
prism
Prism is the easiest way to develop, orchestrate, and execute data pipelines in Python. (by runprism)
-
-
-
dbd
dbd is a database prototyping tool that enables data analysts and engineers to quickly load and transform data in SQL databases.
-
snowflake-provisioning
Snowflake Database, Schema, and Warehouse provisioning with Access Roles & Generating and Provisioning of Functional Roles & Snowflake Source Export, Snowflake cloning, and data tieout tool
-
SaaSHub
SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives
Python Snowflake discussion
Python Snowflake related posts
-
All Data and AI Weekly #192 - June 2, 2025
-
I built a data pipeline tool in Go
-
Show HN: I built an open-source data pipeline tool in Go
-
Show HN: Snowflake warehouse implementation using DuckDB and Apache Iceberg
-
Query Snowflake tables locally without any need for a running warehouse
-
Show HN: SQLFrame – I ran PySpark without Spark on a SQL database
-
Vanna.ai: Chat with your SQL database
-
A note from our sponsor - Stream
getstream.io | 8 Jul 2025
Index
What are some of the best open-source Snowflake projects in Python? This list will help you:
# | Project | Stars |
---|---|---|
1 | airbyte | 18,623 |
2 | sqlglot | 7,947 |
3 | ibis | 5,893 |
4 | ingestr | 3,024 |
5 | soda-core | 2,130 |
6 | jupysql | 788 |
7 | datacompy | 575 |
8 | snowChat | 526 |
9 | versatile-data-kit | 450 |
10 | grai-core | 306 |
11 | snowpark-python | 302 |
12 | dbt-coves | 262 |
13 | CueObserve | 229 |
14 | snowflake-cli | 206 |
15 | dbt-ml-preprocessing | 184 |
16 | diepvries | 126 |
17 | SnowDDL | 123 |
18 | data-observability-installer | 120 |
19 | prism | 85 |
20 | pgwarehouse | 84 |
21 | snowpark-python-template | 75 |
22 | dbd | 57 |
23 | snowflake-provisioning | 43 |