Apache Arrow

Apache Arrow is the universal columnar format and multi-language toolbox for fast data interchange and in-memory analytics (by apache)

Apache Arrow Alternatives

Similar projects and alternatives to Apache Arrow

  1. Pandas

    Flexible and powerful data analysis / manipulation library for Python, providing labeled data structures similar to R data.frame objects, statistical functions, and much more

  2. SaaSHub

    SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives

    SaaSHub logo
  3. Redis

    Redis is an in-memory database that persists on disk. The data model is key-value, but many different kind of values are supported: Strings, Lists, Sets, Sorted Sets, Hashes, Streams, HyperLogLogs, Bitmaps.

  4. ClickHouse

    ClickHouse® is a real-time analytics database management system

  5. polars

    Dataframes powered by a multithreaded, vectorized query engine, written in Rust

  6. Apache Spark

    Apache Spark - A unified analytics engine for large-scale data processing

  7. ruff

    109 Apache Arrow VS ruff

    An extremely fast Python linter and code formatter, written in Rust.

  8. db-benchmark

    reproducible benchmark of database-like ops

  9. simdjson

    Parsing gigabytes of JSON per second : used by Facebook/Meta Velox, the Node.js runtime, ClickHouse, WatermelonDB, Apache Doris, Milvus, StarRocks

  10. Cap'n Proto

    Cap'n Proto serialization/RPC system - core tools and C++ library

  11. FlatBuffers

    FlatBuffers: Memory Efficient Serialization Library

  12. datafusion

    Apache DataFusion SQL Query Engine

  13. perspective

    A data visualization and analytics component, especially well-suited for large and/or streaming datasets.

  14. dragonfly

    A modern replacement for Redis and Memcached

  15. Panda3D

    Powerful, mature open-source cross-platform game engine for Python and C++, developed by Disney and CMU

  16. hudi

    Upserts, Deletes And Incremental Processing on Big Data.

  17. arrow-rs

    Official Rust implementation of Apache Arrow

  18. arquero

    10 Apache Arrow VS arquero

    Query processing and transformation of array-backed data tables.

  19. Airflow

    182 Apache Arrow VS Airflow

    Apache Airflow - A platform to programmatically author, schedule, and monitor workflows

  20. viztracer

    A debugging and profiling tool that can trace and visualize python code execution

  21. h5py

    6 Apache Arrow VS h5py

    HDF5 for Python -- The h5py package is a Pythonic interface to the HDF5 binary data format.

NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a better Apache Arrow alternative or higher similarity.

Apache Arrow discussion

Log in or Post with

Apache Arrow reviews and mentions

Posts with mentions or reviews of Apache Arrow. We have used some of these posts to build our list of alternatives and similar projects. The last one was on 2024-12-12.
  • Unlocking DuckDB from Anywhere - A Guide to Remote Access with Apache Arrow and Flight RPC (gRPC)
    4 projects | dev.to | 12 Dec 2024
    Apache Arrow : It contains a set of technologies that enable big data systems to process and move data fast
  • Using Polars in Rust for high-performance data analysis
    9 projects | dev.to | 30 Oct 2024
    One of the main selling points of Polars over similar solutions such as Pandas is performance. Polars is written in highly optimized Rust and uses the Apache Arrow container format.
  • Kotlin DataFrame ❤️ Arrow
    3 projects | dev.to | 10 Oct 2024
    Kotlin DataFrame v0.14 comes with improvements for reading Apache Arrow format, especially loading a DataFrame from any ArrowReader. This improvement can be used to easily load results from analytical databases (such as DuckDB, ClickHouse) directly into Kotlin DataFrame.
  • Random access string compression with FSST and Rust
    3 projects | news.ycombinator.com | 12 Sep 2024
  • Declarative Multi-Engine Data Stack with Ibis
    6 projects | dev.to | 17 Jul 2024
    Apache Arrow
  • Shades of Open Source - Understanding The Many Meanings of "Open"
    9 projects | dev.to | 15 Jun 2024
    It's this kind of certainty that underscores the vital role of the Apache Software Foundation (ASF). Many first encounter Apache through its pioneering project, the open-source web server framework that remains ubiquitous in web operations today. The ASF was initially created to hold the intellectual property and assets of the Apache project, and it has since evolved into a cornerstone for open-source projects worldwide. The ASF enforces strict standards for diverse contributions, independence, and activity in its projects, ensuring they can withstand the test of time as standards in software development. Many open-source projects strive to become Apache projects to gain the community credibility necessary for adoption as standard software building blocks, such as Apache Tomcat for Java web applications, Apache Arrow for in-memory data representation, and Apache Parquet for data file formatting, among others.
  • The Simdjson Library
    4 projects | news.ycombinator.com | 3 Jun 2024
  • Arrow Flight SQL in Apache Doris for 10X faster data transfer
    2 projects | dev.to | 12 May 2024
    Apache Doris 2.1 has a data transmission channel built on Arrow Flight SQL. (Apache Arrow is a software development platform designed for high data movement efficiency across systems and languages, and the Arrow format aims for high-performance, lossless data exchange.) It allows high-speed, large-scale data reading from Doris via SQL in various mainstream programming languages. For target clients that also support the Arrow format, the whole process will be free of serialization/deserialization, thus no performance loss. Another upside is, Arrow Flight can make full use of multi-node and multi-core architecture and implement parallel data transfer, which is another enabler of high data throughput.
  • How moving from Pandas to Polars made me write better code without writing better code
    2 projects | dev.to | 5 Mar 2024
    In comes Polars: a brand new dataframe library, or how the author Ritchie Vink describes it... a query engine with a dataframe frontend. Polars is built on top of the Arrow memory format and is written in Rust, which is a modern performant and memory-safe systems programming language similar to C/C++.
  • From slow to SIMD: A Go optimization story
    10 projects | news.ycombinator.com | 23 Jan 2024
    I learned yesterday about GoLang's assembler https://go.dev/doc/asm - after browsing how arrow is implemented for different languages (my experience is mainly C/C++) - https://github.com/apache/arrow/tree/main/go/arrow/math - there are bunch of .S ("asm" files) and I'm still not able to comprehend how these work exactly (I guess it'll take more reading) - it seems very peculiar.

    The last time I've used inlined assembly was back in Turbo/Borland Pascal, then bit in Visual Studio (32-bit), until they got disabled. Then did very little gcc with their more strict specification (while the former you had to know how the ABI worked, the latter too - but it was specced out).

    Anyway - I wasn't expecting to find this in "Go" :) But I guess you can always start with .go code then produce assembly (-S) then optimize it, or find/hire someone to do it.

  • A note from our sponsor - SaaSHub
    www.saashub.com | 20 Jan 2025
    SaaSHub helps you find the best software and product alternatives Learn more →

Stats

Basic Apache Arrow repo stats
83
14,854
9.9
2 days ago

Sponsored
SaaSHub - Software Alternatives and Reviews
SaaSHub helps you find the best software and product alternatives
www.saashub.com

Did you know that C++ is
the 7th most popular programming language
based on number of references?