spark-sql

Open-source projects categorized as spark-sql

Top 13 spark-sql Open-Source Projects

  • Redash

    Make Your Company Data Driven. Connect to any data source, easily visualize, dashboard and share your data.

  • Project mention: Redash: Connect to data source, easily visualize, dashboard and share your data | news.ycombinator.com | 2024-03-20
  • spark

    .NET for Apache® Spark™ makes Apache Spark™ easily accessible to .NET developers. (by dotnet)

  • InfluxDB

    Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

    InfluxDB logo
  • kyuubi

    Apache Kyuubi is a distributed and multi-tenant gateway to provide serverless SQL on data warehouses and lakehouses.

  • Jupyter Scala

    A Scala kernel for Jupyter

  • LearningSparkV2

    This is the github repo for Learning Spark: Lightning-Fast Data Analytics [2nd Edition]

  • incubator-gluten

    Gluten is a middle layer responsible for offloading JVM-based SQL engines' execution to native engines.

  • Project mention: A glimpse into the future of data processing infrastructure. | dev.to | 2024-05-02

    When I first learned about the Gluten project from Intel, I thought Databricks was going to be in trouble.

  • jupysql

    Better SQL in Jupyter. 📊

  • Project mention: Show HN: JupySQL – a SQL client for Jupyter (ipython-SQL successor) | news.ycombinator.com | 2023-12-06

    Hey, HN community!

    We're stoked to launch JupySQL today! JupySQL is an open-source library that brings a modern SQL experience to Jupyter. JupySQL is compatible with all major databases, such as Snowflake, Redshift, PostgreSQL, MySQL, MariaDB, DuckDB, SQL Server, Clickhouse, Trino, and more!

    To get started, check out our tutorial: https://jupysql.ploomber.io/en/latest/quick-start.html

    SQL is the defacto language for data analysis; however, analysis often requires a mix of SQL and Python. JupySQL bridges this gap, allowing users to execute SQL queries seamlessly in Jupyter and continue their analysis in Python. Add %%sql to the top of your cell and start writing SQL.

    Here are some of JupySQL's main features:

    - Syntax highlighting

  • SaaSHub

    SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives

    SaaSHub logo
  • ngods-stocks

    New Generation Opensource Data Stack Demo

  • cuelake

    Use SQL to build ELT pipelines on a data lakehouse.

  • qbeast-spark

    Qbeast-spark: DataSource enabling multi-dimensional indexing and efficient data sampling. Big Data, free from the unnecessary!

  • opaque-sql

    An encrypted data analytics platform

  • Sparkplug

    Spark package to "plug" holes in data using SQL based rules ⚡️ 🔌

  • iceberg-intro-workshop

    Hands-on workshop with Apache Iceberg

  • Project mention: FLaNK Stack Weekly for 13 November 2023 | dev.to | 2023-11-13
NOTE: The open source projects on this list are ordered by number of github stars. The number of mentions indicates repo mentiontions in the last 12 Months or since we started tracking (Dec 2020).

spark-sql related posts

  • A glimpse into the future of data processing infrastructure.

    1 project | dev.to | 2 May 2024

Index

What are some of the best open-source spark-sql projects? This list will help you:

Project Stars
1 Redash 24,994
2 spark 1,999
3 kyuubi 1,941
4 Jupyter Scala 1,564
5 LearningSparkV2 1,095
6 incubator-gluten 988
7 jupysql 605
8 ngods-stocks 373
9 cuelake 284
10 qbeast-spark 192
11 opaque-sql 176
12 Sparkplug 28
13 iceberg-intro-workshop 10

Sponsored
SaaSHub - Software Alternatives and Reviews
SaaSHub helps you find the best software and product alternatives
www.saashub.com