DataFusion Comet: Apache Spark Accelerator

This page summarizes the projects mentioned and recommended in the original post on news.ycombinator.com

CodeRabbit: AI Code Reviews for Developers
Revolutionize your code reviews with AI. CodeRabbit offers PR summaries, code walkthroughs, 1-click suggestions, and AST-based analysis. Boost productivity and code quality across all major languages with each PR.
coderabbit.ai
featured
SaaSHub - Software Alternatives and Reviews
SaaSHub helps you find the best software and product alternatives
www.saashub.com
featured
  1. datafusion-comet

    Apache DataFusion Comet Spark Accelerator

  2. CodeRabbit

    CodeRabbit: AI Code Reviews for Developers. Revolutionize your code reviews with AI. CodeRabbit offers PR summaries, code walkthroughs, 1-click suggestions, and AST-based analysis. Boost productivity and code quality across all major languages with each PR.

    CodeRabbit logo
  3. datafusion-ballista

    Apache DataFusion Ballista Distributed Query Engine

    But why. Just ditch Spark and use https://github.com/apache/datafusion-ballista directly.

  4. blaze

    Blazing-fast query execution engine speaks Apache Spark language and has Arrow-DataFusion at its core. (by kwai)

    I've been keeping an eye on these kinds of Spark accelerator libraries for a while now.

    How does it compare to Blaze[1] and Gluten[2]?

    I'm interested in running some benchmarks soon against all three for my project to see how they all go.

    [1] https://github.com/kwai/blaze

  5. incubator-gluten

    Gluten is a middle layer responsible for offloading JVM-based SQL engines' execution to native engines.

NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a more popular project.

Suggest a related project

Related posts

  • GlareDB: An open source SQL database to query and analyze distributed data

    4 projects | /r/dataengineering | 8 Jun 2023
  • GlueSQL: A SQL database engine written as a library in Rust

    4 projects | news.ycombinator.com | 22 Oct 2022
  • Apache DataFusion

    3 projects | news.ycombinator.com | 12 Jan 2025
  • Show HN: TonboLite – Scale SQLite with S3, Minimize ETL

    2 projects | news.ycombinator.com | 7 Jan 2025
  • Building a distributed log using S3 (under 150 lines of Go)

    4 projects | news.ycombinator.com | 1 Dec 2024

Did you know that Rust is
the 5th most popular programming language
based on number of references?