SaaSHub helps you find the best software and product alternatives Learn more →
Apache Arrow Alternatives
Similar projects and alternatives to Apache Arrow
-
Apache Spark
Apache Spark - A unified analytics engine for large-scale data processing
-
polars
Fast multi-threaded, hybrid-out-of-core DataFrame library in Rust | Python | Node.js
-
InfluxDB
Build time-series-based applications quickly and at scale.. InfluxDB is the Time Series Platform where developers build real-time applications for analytics, IoT and cloud-native services. Easy to start, it is available in the cloud or on-premises.
-
Airflow
Apache Airflow - A platform to programmatically author, schedule, and monitor workflows
-
h5py
HDF5 for Python -- The h5py package is a Pythonic interface to the HDF5 binary data format.
-
-
-
-
Sonar
Write Clean C++ Code. Always.. Sonar helps you commit clean C++ code every time. With over 550 unique rules to find C++ bugs, code smells & vulnerabilities, Sonar finds the issues while you focus on the work.
-
Pandas
Flexible and powerful data analysis / manipulation library for Python, providing labeled data structures similar to R data.frame objects, statistical functions, and much more
-
-
-
Cap'n Proto
Cap'n Proto serialization/RPC system - core tools and C++ library
-
perspective
A data visualization and analytics component, especially well-suited for large and/or streaming datasets.
-
-
-
per4m
Profiling and tracing information for Python using viztracer and perf, the GIL exposed.
-
-
-
Redis
Redis is an in-memory database that persists on disk. The data model is key-value, but many different kind of values are supported: Strings, Lists, Sets, Sorted Sets, Hashes, Streams, HyperLogLogs, Bitmaps.
-
-
viztracer
VizTracer is a low-overhead logging/debugging/profiling tool that can trace and visualize your python code execution.
-
SaaSHub
SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives
Apache Arrow reviews and mentions
-
Tried polars in Ruby
Several members of the Ruby community have been deeply involved in the development of Apache Arrow.
-
IPC communication between rust, c++, and python
In that case, why not use polars, which supports apache arrow format which supports C, C++, Rust, Python and supports zero-copy read.
-
Introducing ArrowJS • Reactivity without the framework
I think the naming will likely cause some confusion with apache arrow. My initial thoughts when reading "Introducing ArrowJS" was a new port of the apache arrow spec.
-
Java Serialization with Protocol Buffers
The information can be stored in a database or as files, serialized in a standard format and with a schema agreed with your Data Engineering team. Depending on your information and requirements, it can be as simple as CSV, XML or JSON, or Big Data formats such as Parquet, Avro, ORC, Arrow, or message serialization formats like Protocol Buffers, FlatBuffers, MessagePack, Thrift, or Cap'n Proto.
-
GlueSQL: A SQL database engine written as a library in Rust
Just another embedded SQL engine.
There are SQLite(OLTP), DuckDB(OLAP) and some engine-based project like mentioned Apache Arrow(https://arrow.apache.org/)(OLAP): Apache Arrow has many language implementations, some do not include the query engine(for example, Rust implementation, which depends on the DataFusion for more SQL-like analytics) in its own repo, but other do include(for example, C++).
There is a comprehensive benchmark by ClickHouse for OLAP but including kinds of embedding engines: https://benchmark.clickhouse.com/
The more interesting is that, in fact, we have not an embedded HTAP engine. One of my database products already implements 3/4 HTAP at the engine layer, but unfortunately it's still just a free software, not an open source implementation.
- Looking for a cross-language communication framework between Rust and Python
-
C++ Jobs - Q4 2022
Apache Arrow
-
New Pandas-for-Haskell data frame library: Name suggestions
This is a meta-request for the library, but imo it would be really awesome if it used a data structure compatible with Arrow: https://arrow.apache.org/
-
How to Deploy ML Models Using Gravity AI and Meadowrun
As a bit of an aside, you could imagine a way to get the best of both worlds with an extension to Docker that would allow you to publish a container that exposes a Python API, so that someone could call sentiment = call_container_api(image="huggingface/transformers", "my input text") directly from their python code. This would effectively be a remote procedure call into a container that is not running as a service but instead spun up just for the purpose of executing a function on-demand. This feels like a really heavyweight approach to solving dependency hell, but if your libraries are using a cross-platform memory format (hello Apache Arrow!) under the covers, you could imagine doing some fun tricks like giving the container a read-only view into the caller’s memory space to reduce the overhead. It’s a bit implausible, but sometimes it’s helpful to sketch out these ideas to clarify the tradeoffs we’re making with the more practical bits of technology we have available.
-
Scala needs a good, dependency-free DataFrame library
I assume you mean to use Apache arrow rather than scala Arrow?
-
A note from our sponsor - #<SponsorshipServiceOld:0x00007fea5b92c258>
www.saashub.com | 2 Feb 2023
Stats
apache/arrow is an open source project licensed under Apache License 2.0 which is an OSI approved license.