Jupyter Notebook SQL

Open-source Jupyter Notebook projects categorized as SQL

Top 23 Jupyter Notebook SQL Projects

  1. data-engineer-handbook

    This is a repo with links to everything you'd ever want to learn about data engineering

    Project mention: Data-engineer-handbook: everything to learn about data engineering | news.ycombinator.com | 2024-12-03

    This thing points to some sort of github metrics dashboard.

    The actual handbook is at: https://github.com/DataExpert-io/data-engineer-handbook

  2. Stream

    Stream - Scalable APIs for Chat, Feeds, Moderation, & Video. Stream helps developers build engaging apps that scale to millions with performant and flexible Chat, Feeds, Moderation, and Video APIs and SDKs powered by a global edge network and enterprise-grade infrastructure.

    Stream logo
  3. The-Complete-FAANG-Preparation

    Dive into this repository, a comprehensive resource covering Data Structures, Algorithms, 450 DSA by Love Babbar, Striver DSA sheet, Apna College DSA Sheet, and FAANG Questions! 🚀 That's not all! We've got Technical Subjects like Operating Systems, DBMS, SQL, Computer Networks, and Object-Oriented Programming, all waiting for you.

  4. beakerx

    Beaker Extensions for Jupyter Notebook

  5. logica

    Logica is a logic programming language that compiles to SQL. It runs on DuckDB, Google BigQuery, PostgreSQL and SQLite.

    Project mention: Logica – declarative logic programming language for data | news.ycombinator.com | 2024-11-16
  6. tigertoolbox

    Toolbox repository for Tiger team

    Project mention: SQL Server – Query Performance – Database Maintenance can Help | dev.to | 2025-01-14
  7. bigquery-utils

    Useful scripts, udfs, views, and other utilities for migration and data warehouse operations in BigQuery.

    Project mention: PostgreSQL Maximalism | dev.to | 2025-05-28

    Alternatives to: DuckDB, Apache Cassandra, Amazon RedShift, Google BigQuery, Snowflake, InfluxDB, Prometheus, Amazon Timestream

  8. spyql

    Query data on the command line with SQL-like SELECTs powered by Python expressions

  9. InfluxDB

    InfluxDB – Built for High-Performance Time Series Workloads. InfluxDB 3 OSS is now GA. Transform, enrich, and act on time series data directly in the database. Automate critical tasks and eliminate the need to move data externally. Download now.

    InfluxDB logo
  10. SQL-for-Data-Analytics

    Perform fast and efficient data analysis with the power of SQL

  11. RasgoQL

    Write python locally, execute SQL in your data warehouse

  12. snowflake-demo-notebooks

    Collection of Snowflake Notebook demos, tutorials, and examples

    Project mention: All Data and AI Weekly #182 - 24-March-2025 | dev.to | 2025-03-24
  13. lang2sql

    A tutorial for setting an SQL code generator with the OpenAI API

  14. openbrewerydb

    🍻 An open-source dataset of breweries, cideries, brewpubs, and bottleshops.

  15. universql

    Pushdown compute from Snowflake to DuckDB running on your infrastructure

    Project mention: Test Postgres in Python Like SQLite | news.ycombinator.com | 2025-06-05

    I'm developing https://github.com/buremba/universql for this use-case, though DuckDB is used under the hood.

  16. data-science-notes

    Notes of IBM Data Science Professional Certificate Courses on Coursera

  17. spaces-notebooks

    Collection of notebooks for use with SingleStoreDB

  18. vulcan-sql-examples

    Curated VulcanSQL show cases

  19. Data-Engineering-Portfolio

    I'm learning how to build data pipelines to work with large datasets. (:

  20. data-engineering-nd

    Projects of the Udacity Data Engineering Nanodegree Program.

  21. sqlab

    SQL Adventure Builder: transform a dataset and a collection of SQL exercises into a self-contained database

    Project mention: Show HN: SQL Noir – Learn SQL by solving crimes | news.ycombinator.com | 2025-02-14

    I myself am working on SQLab, a SQL game engine that allows you to augment an arbitrary base with exercises on that base to produce directed, standalone adventures: https://github.com/laowantong/sqlab.

  22. SQL-LLM-Distillation-GRPO

    Inspired by mathematical reasoning models like DeepSeekMath, this framework applies CoT to SQL generation and fine-tunes distilled models using GRPO to enhance both accuracy and interpretability.

    Project mention: Structured Reasoning Distillation in SQL via CoT and Reinforcement Learning with GRPO | dev.to | 2025-05-22

    Repo: SQL-LLM-Distillation-GRPO Dataset: sql-distill-llama-3-1-70b-instruct-reasoning Model Fine-Tuned: sql-llama3.2-3b-it-reasoning

  23. My-Projects

    My Projects

  24. rfm_sales_analysis

    This project offers a data-driven approach to analyze sales performance using RFM (Recency, Frequency, Monetary Value) analysis, a powerful technique for gaining insights into customer behavior and optimizing marketing strategies.

  25. udacity_bike_share_datalake_project

    Azure Data Lake

  26. SaaSHub

    SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives

    SaaSHub logo
NOTE: The open source projects on this list are ordered by number of github stars. The number of mentions indicates repo mentiontions in the last 12 Months or since we started tracking (Dec 2020).

Jupyter Notebook SQL discussion

Log in or Post with

Jupyter Notebook SQL related posts

  • SQL Server – Query Performance – Database Maintenance can Help

    1 project | dev.to | 14 Jan 2025
  • Logica – declarative logic programming language for data

    4 projects | news.ycombinator.com | 16 Nov 2024
  • Game Programming in Prolog

    5 projects | news.ycombinator.com | 10 Oct 2024
  • Logica

    1 project | news.ycombinator.com | 21 Jan 2024
  • Performance Tuning Production

    1 project | /r/SQLServer | 28 Jun 2023
  • New welcome page for Logica language

    1 project | news.ycombinator.com | 23 May 2023
  • SQL Server, SSRS, "login failed for anonymous" and Kerberos Config Mgr

    1 project | /r/sysadmin | 28 Apr 2023
  • A note from our sponsor - InfluxDB
    www.influxdata.com | 10 Jul 2025
    InfluxDB 3 OSS is now GA. Transform, enrich, and act on time series data directly in the database. Automate critical tasks and eliminate the need to move data externally. Download now. Learn more →

Index

What are some of the best open-source SQL projects in Jupyter Notebook? This list will help you:

# Project Stars
1 data-engineer-handbook 34,992
2 The-Complete-FAANG-Preparation 11,288
3 beakerx 2,817
4 logica 1,983
5 tigertoolbox 1,542
6 bigquery-utils 1,219
7 spyql 930
8 SQL-for-Data-Analytics 287
9 RasgoQL 270
10 snowflake-demo-notebooks 252
11 lang2sql 246
12 openbrewerydb 193
13 universql 185
14 data-science-notes 68
15 spaces-notebooks 24
16 vulcan-sql-examples 22
17 Data-Engineering-Portfolio 14
18 data-engineering-nd 9
19 sqlab 10
20 SQL-LLM-Distillation-GRPO 4
21 My-Projects 1
22 rfm_sales_analysis 0
23 udacity_bike_share_datalake_project 0

Sponsored
Stream - Scalable APIs for Chat, Feeds, Moderation, & Video.
Stream helps developers build engaging apps that scale to millions with performant and flexible Chat, Feeds, Moderation, and Video APIs and SDKs powered by a global edge network and enterprise-grade infrastructure.
getstream.io

Did you know that Jupyter Notebook is
the 13th most popular programming language
based on number of references?