An SQL Solution for Jupyter

This page summarizes the projects mentioned and recommended in the original post on news.ycombinator.com

InfluxDB - Power Real-Time Data Analytics at Scale
Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
www.influxdata.com
featured
SaaSHub - Software Alternatives and Reviews
SaaSHub helps you find the best software and product alternatives
www.saashub.com
featured
  • Jupyter Scala

    A Scala kernel for Jupyter

  • We have used https://almond.sh/ to create a Spark SQL interpreter using Jupyter Notebooks - plus a whole lot more which you can see here: https://arc.tripl.ai/tutorial

    After seeing many companies writing ETL using code we decided it was too hard to manage at scale so provided this abstraction layer - which is heavily centered around expressing business logic in SQL - to standardise development (JupyterLab) and allow rapid deployments.

  • xeus-tidb

    Discontinued A Jupyter kernel for TiDB

  • The Github support for notebooks is so nice (was linked from the example pic caption: https://github.com/wangfenjin/xeus-tidb/blob/develop/example...)

  • InfluxDB

    Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

    InfluxDB logo
  • xeus-sql

    Jupyter kernel for SQL databases

  • soci

    Official repository of the SOCI - The C++ Database Access Library

  • but we offer full support to SOCI meaning if these folks: https://github.com/SOCI/soci support it and the dependencies exist and work for 32bit, than yes.

  • sparkmagic

    Jupyter magics and kernels for working with remote Spark clusters

  • Jupyter would be even better if it supported the seamless combination of Python and SQL code cells.

    My notebook code typically involves a data prep stage with querying a SQL database, then downloading into Python for more complex analysis, ML modelling, integration with external data sources, etc. So the notebook has a Python kernel with SQL usually as embedded """-quoted strings.

    Does anyone have a solution to treating selected code cells as SQL - with SQL highlighting and tooltips - exposed as string variables to the Python code?

    Sparkmagic [1] does part of this for Python/SQL/Spark interoperability, but as far as I recall, doesn't support SQL syntax highlighting.

    [1] https://github.com/jupyter-incubator/sparkmagic

  • allthekernels

    A multiplexer kernel for Jupyter

  • there have been some efforts in the past to do that: https://github.com/minrk/allthekernels

NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a more popular project.

Suggest a related project

Related posts

  • Ыelf-hosted alternative to Looker

    1 project | /r/selfhosted | 27 Sep 2021
  • I've built an Open Source alternative to Looker using Hasura GraphQL engine, need your feedback

    1 project | /r/SideProject | 27 Sep 2021
  • Open Source alternative to Looker built with Hasura GraphQL Engine

    1 project | /r/graphql | 27 Sep 2021
  • Show HN: Low-code metrics store and Open Source alternative to Looker

    1 project | news.ycombinator.com | 27 Sep 2021
  • What do you use webscraping for?

    1 project | /r/webscraping | 12 Sep 2021