Show HN: SQLFrame – I ran PySpark without Spark on a SQL database

This page summarizes the projects mentioned and recommended in the original post on news.ycombinator.com

InfluxDB – Built for High-Performance Time Series Workloads
InfluxDB 3 OSS is now GA. Transform, enrich, and act on time series data directly in the database. Automate critical tasks and eliminate the need to move data externally. Download now.
www.influxdata.com
featured
Stream - Scalable APIs for Chat, Feeds, Moderation, & Video.
Stream helps developers build engaging apps that scale to millions with performant and flexible Chat, Feeds, Moderation, and Video APIs and SDKs powered by a global edge network and enterprise-grade infrastructure.
getstream.io
featured
  1. sqlframe

    Turning PySpark Into a Universal DataFrame API

  2. InfluxDB

    InfluxDB – Built for High-Performance Time Series Workloads. InfluxDB 3 OSS is now GA. Transform, enrich, and act on time series data directly in the database. Automate critical tasks and eliminate the need to move data externally. Download now.

    InfluxDB logo
  3. snowpark-python

    Snowflake Snowpark Python API

    This is cool and in my mind super useful for migrations.

    It seems the main benefit of using something like that in daily life is that it's more convenient to generate complex SQL statements (like pivoting a table with a lot of columns).

    However, I never really liked the PySpark dataframe api and looking at the code examples, SQL has the same visual complexity.

    Snowflake has built something similar (just for Snowflake) SnowPark [1]. Here one promoted benefit was that you could also inject native Python function and "extend" the SQL dialect. However, I don't think it really took off.

    [1] https://github.com/snowflakedb/snowpark-python

  4. ibis

    the portable Python dataframe library

NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a more popular project.

Suggest a related project

Related posts

  • ConnectorX: Accelerating Data Loading From Databases to Dataframes

    1 project | news.ycombinator.com | 17 Mar 2025
  • Polars

    11 projects | news.ycombinator.com | 8 Jan 2024
  • A LLM+OLAP Solution

    4 projects | news.ycombinator.com | 11 Sep 2023
  • I used multiprocessing and multithreading at the same time to drop the execution time of my code from 155+ seconds to just over 2+ seconds

    1 project | /r/Python | 29 May 2023
  • Pure Python Distributed SQL Engine

    9 projects | news.ycombinator.com | 30 Dec 2022

Did you know that Python is
the 2nd most popular programming language
based on number of references?