InfluxDB 3 OSS is now GA. Transform, enrich, and act on time series data directly in the database. Automate critical tasks and eliminate the need to move data externally. Download now. Learn more →
Top 23 Jupyter Notebook SQL Projects
-
data-engineer-handbook
This is a repo with links to everything you'd ever want to learn about data engineering
Project mention: Data-engineer-handbook: everything to learn about data engineering | news.ycombinator.com | 2024-12-03This thing points to some sort of github metrics dashboard.
The actual handbook is at: https://github.com/DataExpert-io/data-engineer-handbook
-
Stream
Stream - Scalable APIs for Chat, Feeds, Moderation, & Video. Stream helps developers build engaging apps that scale to millions with performant and flexible Chat, Feeds, Moderation, and Video APIs and SDKs powered by a global edge network and enterprise-grade infrastructure.
-
The-Complete-FAANG-Preparation
Dive into this repository, a comprehensive resource covering Data Structures, Algorithms, 450 DSA by Love Babbar, Striver DSA sheet, Apna College DSA Sheet, and FAANG Questions! 🚀 That's not all! We've got Technical Subjects like Operating Systems, DBMS, SQL, Computer Networks, and Object-Oriented Programming, all waiting for you.
-
-
logica
Logica is a logic programming language that compiles to SQL. It runs on DuckDB, Google BigQuery, PostgreSQL and SQLite.
Project mention: Logica – declarative logic programming language for data | news.ycombinator.com | 2024-11-16 -
Project mention: SQL Server – Query Performance – Database Maintenance can Help | dev.to | 2025-01-14
-
bigquery-utils
Useful scripts, udfs, views, and other utilities for migration and data warehouse operations in BigQuery.
Alternatives to: DuckDB, Apache Cassandra, Amazon RedShift, Google BigQuery, Snowflake, InfluxDB, Prometheus, Amazon Timestream
-
-
InfluxDB
InfluxDB – Built for High-Performance Time Series Workloads. InfluxDB 3 OSS is now GA. Transform, enrich, and act on time series data directly in the database. Automate critical tasks and eliminate the need to move data externally. Download now.
-
-
-
-
-
-
I'm developing https://github.com/buremba/universql for this use-case, though DuckDB is used under the hood.
-
-
-
-
-
-
sqlab
SQL Adventure Builder: transform a dataset and a collection of SQL exercises into a self-contained database
Project mention: Show HN: SQL Noir – Learn SQL by solving crimes | news.ycombinator.com | 2025-02-14I myself am working on SQLab, a SQL game engine that allows you to augment an arbitrary base with exercises on that base to produce directed, standalone adventures: https://github.com/laowantong/sqlab.
-
SQL-LLM-Distillation-GRPO
Inspired by mathematical reasoning models like DeepSeekMath, this framework applies CoT to SQL generation and fine-tunes distilled models using GRPO to enhance both accuracy and interpretability.
Project mention: Structured Reasoning Distillation in SQL via CoT and Reinforcement Learning with GRPO | dev.to | 2025-05-22Repo: SQL-LLM-Distillation-GRPO Dataset: sql-distill-llama-3-1-70b-instruct-reasoning Model Fine-Tuned: sql-llama3.2-3b-it-reasoning
-
-
rfm_sales_analysis
This project offers a data-driven approach to analyze sales performance using RFM (Recency, Frequency, Monetary Value) analysis, a powerful technique for gaining insights into customer behavior and optimizing marketing strategies.
-
SaaSHub
SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives
Jupyter Notebook SQL discussion
Jupyter Notebook SQL related posts
-
SQL Server – Query Performance – Database Maintenance can Help
-
Logica – declarative logic programming language for data
-
Game Programming in Prolog
-
Logica
-
Performance Tuning Production
-
New welcome page for Logica language
-
SQL Server, SSRS, "login failed for anonymous" and Kerberos Config Mgr
-
A note from our sponsor - InfluxDB
www.influxdata.com | 10 Jul 2025
Index
What are some of the best open-source SQL projects in Jupyter Notebook? This list will help you:
# | Project | Stars |
---|---|---|
1 | data-engineer-handbook | 34,992 |
2 | The-Complete-FAANG-Preparation | 11,288 |
3 | beakerx | 2,817 |
4 | logica | 1,983 |
5 | tigertoolbox | 1,542 |
6 | bigquery-utils | 1,219 |
7 | spyql | 930 |
8 | SQL-for-Data-Analytics | 287 |
9 | RasgoQL | 270 |
10 | snowflake-demo-notebooks | 252 |
11 | lang2sql | 246 |
12 | openbrewerydb | 193 |
13 | universql | 185 |
14 | data-science-notes | 68 |
15 | spaces-notebooks | 24 |
16 | vulcan-sql-examples | 22 |
17 | Data-Engineering-Portfolio | 14 |
18 | data-engineering-nd | 9 |
19 | sqlab | 10 |
20 | SQL-LLM-Distillation-GRPO | 4 |
21 | My-Projects | 1 |
22 | rfm_sales_analysis | 0 |
23 | udacity_bike_share_datalake_project | 0 |