Python Bigquery

Open-source Python projects categorized as Bigquery

Top 23 Python Bigquery Projects

  1. Redash

    Make Your Company Data Driven. Connect to any data source, easily visualize, dashboard and share your data.

    Project mention: The 50 best open-source alternatives to popular SaaS software | dev.to | 2024-07-10

    GitHub: Redash GitHub Repository

  2. CodeRabbit

    CodeRabbit: AI Code Reviews for Developers. Revolutionize your code reviews with AI. CodeRabbit offers PR summaries, code walkthroughs, 1-click suggestions, and AST-based analysis. Boost productivity and code quality across all major languages with each PR.

    CodeRabbit logo
  3. airbyte

    The leading data integration platform for ETL / ELT data pipelines from APIs, databases & files to data warehouses, data lakes & data lakehouses. Both self-hosted and Cloud-hosted.

    Project mention: Stream Processing Systems in 2025: RisingWave, Flink, Spark Streaming, and What's Ahead | dev.to | 2025-01-27

    Whenever we discuss event streaming, Kafka inevitably enters the conversation. As the de facto standard for event streaming, Kafka is widely used as a data pipeline to move data between systems. However, Kafka is not the only tool capable of facilitating data movement. Products like Fivetran, Airbyte, and other SaaS offerings provide user-friendly tools for data ingestion, expanding the options available to engineers.

  4. sqlglot

    Python SQL Parser and Transpiler

    Project mention: Duckberg! | dev.to | 2025-03-12

    This could be a nice option to add sqlglot here. As an advanced sql parsing library.

  5. ibis

    the portable Python dataframe library

    Project mention: Polars Cloud: The Distributed Cloud Architecture to Run Polars Anywhere | news.ycombinator.com | 2025-03-07

    Ibis also solves this problem by providing a portable dataframe API that works across multiple backends (DuckDB by default): https://ibis-project.org/

  6. ethereum-etl

    Python scripts for ETL (extract, transform and load) jobs for Ethereum blocks, transactions, ERC20 / ERC721 tokens, transfers, receipts, logs, contracts, internal transactions. Data is available in Google BigQuery https://goo.gl/oY5BCQ

  7. ingestr

    ingestr is a CLI tool to copy data between any databases with a single command seamlessly.

    Project mention: I built a data pipeline tool in Go | dev.to | 2024-12-23

    📥 ingest data with ingestr / Python

  8. professional-services

    Common solutions and tools developed by Google Cloud's Professional Services team. This repository and its contents are not an officially supported Google product.

  9. SaaSHub

    SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives

    SaaSHub logo
  10. jupysql

    Better SQL in Jupyter. 📊

  11. python-bigquery-pandas

    Google BigQuery connector for pandas

  12. BigQuery-Python

    Simple Python client for interacting with Google BigQuery.

  13. pypinfo

    Easily view PyPI download statistics via Google's BigQuery.

  14. astro-sdk

    Astro SDK allows rapid and clean development of {Extract, Load, Transform} workflows using Python and SQL, powered by Apache Airflow.

  15. dbt-coves

    CLI tool for dbt users to simplify creation of staging models (yml and sql) files

  16. bigquery-schema-generator

    Generates the BigQuery schema from newline-delimited JSON or CSV data records.

  17. python-bigquery-dataframes

    BigQuery DataFrames

  18. CueObserve

    Timeseries Anomaly detection and Root Cause Analysis on data in SQL data warehouses and databases

  19. dbt-ml-preprocessing

    A SQL port of python's scikit-learn preprocessing module, provided as cross-database dbt macros.

  20. dataproc-templates

    Dataproc templates and pipelines for solving simple in-cloud data tasks

  21. bigquery_fdw

    BigQuery Foreign Data Wrapper for PostgreSQL

  22. prism

    Prism is the easiest way to develop, orchestrate, and execute data pipelines in Python. (by runprism)

  23. iris3

    An upgraded and improved version of the Iris automatic GCP-labeling project

  24. dbd

    dbd is a database prototyping tool that enables data analysts and engineers to quickly load and transform data in SQL databases.

  25. SaaSHub

    SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives

    SaaSHub logo
NOTE: The open source projects on this list are ordered by number of github stars. The number of mentions indicates repo mentiontions in the last 12 Months or since we started tracking (Dec 2020).

Python Bigquery discussion

Log in or Post with

Python Bigquery related posts

  • I built a data pipeline tool in Go

    3 projects | dev.to | 23 Dec 2024
  • Show HN: I built an open-source data pipeline tool in Go

    6 projects | news.ycombinator.com | 17 Dec 2024
  • This Week In Python

    5 projects | dev.to | 17 Mar 2024
  • Show HN: I built an open-source data copy tool called ingestr

    3 projects | news.ycombinator.com | 27 Feb 2024
  • Ingestr: CLI tool to copy data between any databases with a single command

    1 project | news.ycombinator.com | 27 Feb 2024
  • JupySQL: Connecting to a SQL database from Jupyter

    1 project | /r/SQL | 9 Sep 2023
  • GitHub - ploomber/jupysql: Better SQL in Jupyter. 📊

    1 project | /r/coolgithubprojects | 6 Sep 2023
  • A note from our sponsor - CodeRabbit
    coderabbit.ai | 19 Mar 2025
    Revolutionize your code reviews with AI. CodeRabbit offers PR summaries, code walkthroughs, 1-click suggestions, and AST-based analysis. Boost productivity and code quality across all major languages with each PR. Learn more →

Index

What are some of the best open-source Bigquery projects in Python? This list will help you:

# Project Stars
1 Redash 27,065
2 airbyte 17,524
3 sqlglot 7,314
4 ibis 5,596
5 ethereum-etl 2,994
6 ingestr 2,908
7 professional-services 2,874
8 swirl-search 2,698
9 jupysql 752
10 python-bigquery-pandas 462
11 BigQuery-Python 456
12 pypinfo 430
13 astro-sdk 367
14 dbt-coves 259
15 bigquery-schema-generator 241
16 python-bigquery-dataframes 235
17 CueObserve 228
18 dbt-ml-preprocessing 182
19 dataproc-templates 124
20 bigquery_fdw 91
21 prism 84
22 iris3 71
23 dbd 57

Sponsored
CodeRabbit: AI Code Reviews for Developers
Revolutionize your code reviews with AI. CodeRabbit offers PR summaries, code walkthroughs, 1-click suggestions, and AST-based analysis. Boost productivity and code quality across all major languages with each PR.
coderabbit.ai

Did you know that Python is
the 2nd most popular programming language
based on number of references?