Apache Arrow is a cross-language development platform for in-memory data. It specifies a standardized language-independent columnar memory format for flat and hierarchical data, organized for efficient analytic operations on modern hardware. It also provides computational libraries and zero-copy streaming messaging and interprocess communication. Languages currently supported include C, C++, Java, JavaScript, Python, and Ruby. (by apache)

Arrow Alternatives

Similar projects and alternatives to arrow
  • per4m

    Profiling and tracing information for Python using viztracer and perf, the GIL exposed.

  • viztracer

    VizTracer is a low-overhead logging/debugging/profiling tool that can trace and visualize your python code execution.

  • cpython

    The Python programming language

  • spark

    Apache Spark - A unified analytics engine for large-scale data processing

  • vaex

    Out-of-Core DataFrames for Python, ML, visualize and explore big tabular data at a billion rows per second 🚀

  • cudf

    cuDF - GPU DataFrame Library

  • h5py

    HDF5 for Python -- The h5py package is a Pythonic interface to the HDF5 binary data format.

  • spark-rapids

    Spark RAPIDS plugin - accelerate Apache Spark with GPUs

  • gil_load

    Utility for measuring the fraction of time the CPython GIL is held


    A utility for dumping per-thread statistics for CPython GIL using eBPF

NOTE: The number of mentions on this list indicates mentions on common posts. Hence, a higher number means a better alternative or higher similarity.


Posts where arrow has been mentioned. We have used some of these posts to build our list of alternatives and similar projects.


Basic arrow repo stats
5 days ago

apache/arrow is an open source project licensed under Apache License 2.0 which is an OSI approved license.