Revolutionize your code reviews with AI. CodeRabbit offers PR summaries, code walkthroughs, 1-click suggestions, and AST-based analysis. Boost productivity and code quality across all major languages with each PR. Learn more →
Top 23 Python Bigquery Projects
-
Redash
Make Your Company Data Driven. Connect to any data source, easily visualize, dashboard and share your data.
Project mention: The 50 best open-source alternatives to popular SaaS software | dev.to | 2024-07-10GitHub: Redash GitHub Repository
-
CodeRabbit
CodeRabbit: AI Code Reviews for Developers. Revolutionize your code reviews with AI. CodeRabbit offers PR summaries, code walkthroughs, 1-click suggestions, and AST-based analysis. Boost productivity and code quality across all major languages with each PR.
-
airbyte
The leading data integration platform for ETL / ELT data pipelines from APIs, databases & files to data warehouses, data lakes & data lakehouses. Both self-hosted and Cloud-hosted.
Project mention: Stream Processing Systems in 2025: RisingWave, Flink, Spark Streaming, and What's Ahead | dev.to | 2025-01-27Whenever we discuss event streaming, Kafka inevitably enters the conversation. As the de facto standard for event streaming, Kafka is widely used as a data pipeline to move data between systems. However, Kafka is not the only tool capable of facilitating data movement. Products like Fivetran, Airbyte, and other SaaS offerings provide user-friendly tools for data ingestion, expanding the options available to engineers.
-
This could be a nice option to add sqlglot here. As an advanced sql parsing library.
-
Project mention: Polars Cloud: The Distributed Cloud Architecture to Run Polars Anywhere | news.ycombinator.com | 2025-03-07
Ibis also solves this problem by providing a portable dataframe API that works across multiple backends (DuckDB by default): https://ibis-project.org/
-
ethereum-etl
Python scripts for ETL (extract, transform and load) jobs for Ethereum blocks, transactions, ERC20 / ERC721 tokens, transfers, receipts, logs, contracts, internal transactions. Data is available in Google BigQuery https://goo.gl/oY5BCQ
-
📥 ingest data with ingestr / Python
-
professional-services
Common solutions and tools developed by Google Cloud's Professional Services team. This repository and its contents are not an officially supported Google product.
-
SaaSHub
SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives
-
swirl-search
Swirl is an open-source search platform that uses AI to search multiple content and data sources simultaneously and return AI-ranked results. And provides summaries of your answers from searches using LLMs. It's a one-click, easy-to-use Retrieval Augmented Generation (RAG) Solution.
Project mention: How These Free Open Source Projects Can Jumpstart Your Career (No Experience? No Problem!) | dev.to | 2024-12-13Give SWIRL a try: https://github.com/swirlai/swirl-search
-
-
-
-
-
astro-sdk
Astro SDK allows rapid and clean development of {Extract, Load, Transform} workflows using Python and SQL, powered by Apache Airflow.
-
-
bigquery-schema-generator
Generates the BigQuery schema from newline-delimited JSON or CSV data records.
-
-
CueObserve
Timeseries Anomaly detection and Root Cause Analysis on data in SQL data warehouses and databases
-
dbt-ml-preprocessing
A SQL port of python's scikit-learn preprocessing module, provided as cross-database dbt macros.
-
-
-
prism
Prism is the easiest way to develop, orchestrate, and execute data pipelines in Python. (by runprism)
-
-
dbd
dbd is a database prototyping tool that enables data analysts and engineers to quickly load and transform data in SQL databases.
-
SaaSHub
SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives
Python Bigquery discussion
Python Bigquery related posts
-
I built a data pipeline tool in Go
-
Show HN: I built an open-source data pipeline tool in Go
-
This Week In Python
-
Show HN: I built an open-source data copy tool called ingestr
-
Ingestr: CLI tool to copy data between any databases with a single command
-
JupySQL: Connecting to a SQL database from Jupyter
-
GitHub - ploomber/jupysql: Better SQL in Jupyter. 📊
-
A note from our sponsor - CodeRabbit
coderabbit.ai | 19 Mar 2025
Index
What are some of the best open-source Bigquery projects in Python? This list will help you:
# | Project | Stars |
---|---|---|
1 | Redash | 27,065 |
2 | airbyte | 17,524 |
3 | sqlglot | 7,314 |
4 | ibis | 5,596 |
5 | ethereum-etl | 2,994 |
6 | ingestr | 2,908 |
7 | professional-services | 2,874 |
8 | swirl-search | 2,698 |
9 | jupysql | 752 |
10 | python-bigquery-pandas | 462 |
11 | BigQuery-Python | 456 |
12 | pypinfo | 430 |
13 | astro-sdk | 367 |
14 | dbt-coves | 259 |
15 | bigquery-schema-generator | 241 |
16 | python-bigquery-dataframes | 235 |
17 | CueObserve | 228 |
18 | dbt-ml-preprocessing | 182 |
19 | dataproc-templates | 124 |
20 | bigquery_fdw | 91 |
21 | prism | 84 |
22 | iris3 | 71 |
23 | dbd | 57 |