Python Snowflake

Open-source Python projects categorized as Snowflake

Top 23 Python Snowflake Projects

  1. airbyte

    The leading data integration platform for ETL / ELT data pipelines from APIs, databases & files to data warehouses, data lakes & data lakehouses. Both self-hosted and Cloud-hosted.

    Project mention: 7 Best Data Integration Platforms: Reviews & Top Picks | dev.to | 2025-05-26

    Website: https://airbyte.com/

  2. InfluxDB

    InfluxDB – Built for High-Performance Time Series Workloads. InfluxDB 3 OSS is now GA. Transform, enrich, and act on time series data directly in the database. Automate critical tasks and eliminate the need to move data externally. Download now.

    InfluxDB logo
  3. sqlglot

    Python SQL Parser and Transpiler

    Project mention: Show HN: SQL-tString a t-string SQL builder in Python | news.ycombinator.com | 2025-05-16

    https://github.com/tobymao/sqlglot :

    > SQLGlot is a no-dependency SQL parser, transpiler, optimizer, and engine [written in Python] . It can be used to format SQL or translate between 24 different dialects like DuckDB, Presto / Trino, Spark / Databricks, Snowflake, and BigQuery. It aims to read a wide variety of SQL inputs and output syntactically and semantically correct SQL in the targeted dialects.

  4. ibis

    the portable Python dataframe library

    Project mention: Why Pandas feels clunky when coming from R (2024) | news.ycombinator.com | 2025-06-07

    pandas* per the style guide (nobody follows it)

    also I recommend trying Ibis. created by the creator of pandas originally and solves so many of the issues

    https://ibis-project.org

  5. ingestr

    ingestr is a CLI tool to copy data between any databases with a single command seamlessly.

    Project mention: Ingestr: Your New Best Friend for Effortless Data Migration | dev.to | 2025-06-28

    View the Project on GitHub

  6. soda-core

    :zap: Data quality testing for the modern data stack (SQL, Spark, and Pandas) https://www.soda.io

  7. jupysql

    Better SQL in Jupyter. 📊

  8. datacompy

    Pandas, Polars, Spark, and Snowpark DataFrame comparison for humans and more!

  9. Stream

    Stream - Scalable APIs for Chat, Feeds, Moderation, & Video. Stream helps developers build engaging apps that scale to millions with performant and flexible Chat, Feeds, Moderation, and Video APIs and SDKs powered by a global edge network and enterprise-grade infrastructure.

    Stream logo
  10. snowChat

    Chat snowflake - Text to SQL

  11. versatile-data-kit

    One framework to develop, deploy and operate data workflows with Python and SQL.

  12. grai-core

  13. snowpark-python

    Snowflake Snowpark Python API

  14. dbt-coves

    CLI tool for dbt users to simplify creation of staging models (yml and sql) files

  15. CueObserve

    Timeseries Anomaly detection and Root Cause Analysis on data in SQL data warehouses and databases

  16. snowflake-cli

    Snowflake CLI is an open-source command-line tool explicitly designed for developer-centric workloads in addition to SQL operations.

  17. dbt-ml-preprocessing

    A SQL port of python's scikit-learn preprocessing module, provided as cross-database dbt macros.

  18. diepvries

    The Picnic Data Vault framework.

  19. SnowDDL

    Declarative-style object management tool for Snowflake.

  20. data-observability-installer

    Installer for DataKitchen's Open Source Data Observability Products. Data breaks. Servers break. Your toolchain breaks. Ensure your team is the first to know and the first to solve with visibility across and down your data estate. Save time with simple, fast data quality test generation and execution. Trust your data, tools, and systems end to end.

  21. prism

    Prism is the easiest way to develop, orchestrate, and execute data pipelines in Python. (by runprism)

  22. pgwarehouse

    Easily sync your Postgres database to a Snowflake, ClickHouse, or DuckDB warehouse.

  23. snowpark-python-template

    Python project template for Snowpark development

  24. dbd

    dbd is a database prototyping tool that enables data analysts and engineers to quickly load and transform data in SQL databases.

  25. snowflake-provisioning

    Snowflake Database, Schema, and Warehouse provisioning with Access Roles & Generating and Provisioning of Functional Roles & Snowflake Source Export, Snowflake cloning, and data tieout tool

  26. SaaSHub

    SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives

    SaaSHub logo
NOTE: The open source projects on this list are ordered by number of github stars. The number of mentions indicates repo mentiontions in the last 12 Months or since we started tracking (Dec 2020).

Python Snowflake discussion

Log in or Post with

Python Snowflake related posts

  • All Data and AI Weekly #192 - June 2, 2025

    8 projects | dev.to | 2 Jun 2025
  • I built a data pipeline tool in Go

    3 projects | dev.to | 23 Dec 2024
  • Show HN: I built an open-source data pipeline tool in Go

    6 projects | news.ycombinator.com | 17 Dec 2024
  • Show HN: Snowflake warehouse implementation using DuckDB and Apache Iceberg

    1 project | news.ycombinator.com | 4 Sep 2024
  • Query Snowflake tables locally without any need for a running warehouse

    1 project | news.ycombinator.com | 28 Aug 2024
  • Show HN: SQLFrame – I ran PySpark without Spark on a SQL database

    3 projects | news.ycombinator.com | 20 May 2024
  • Vanna.ai: Chat with your SQL database

    13 projects | news.ycombinator.com | 14 Jan 2024
  • A note from our sponsor - Stream
    getstream.io | 8 Jul 2025
    Stream helps developers build engaging apps that scale to millions with performant and flexible Chat, Feeds, Moderation, and Video APIs and SDKs powered by a global edge network and enterprise-grade infrastructure. Learn more →

Index

What are some of the best open-source Snowflake projects in Python? This list will help you:

# Project Stars
1 airbyte 18,623
2 sqlglot 7,947
3 ibis 5,893
4 ingestr 3,024
5 soda-core 2,130
6 jupysql 788
7 datacompy 575
8 snowChat 526
9 versatile-data-kit 450
10 grai-core 306
11 snowpark-python 302
12 dbt-coves 262
13 CueObserve 229
14 snowflake-cli 206
15 dbt-ml-preprocessing 184
16 diepvries 126
17 SnowDDL 123
18 data-observability-installer 120
19 prism 85
20 pgwarehouse 84
21 snowpark-python-template 75
22 dbd 57
23 snowflake-provisioning 43

Sponsored
InfluxDB – Built for High-Performance Time Series Workloads
InfluxDB 3 OSS is now GA. Transform, enrich, and act on time series data directly in the database. Automate critical tasks and eliminate the need to move data externally. Download now.
www.influxdata.com