perspective
Apache Arrow
Our great sponsors
perspective | Apache Arrow | |
---|---|---|
32 | 57 | |
5,203 | 11,042 | |
1.4% | 1.9% | |
9.6 | 10.0 | |
6 days ago | 5 days ago | |
C++ | C++ | |
Apache License 2.0 | Apache License 2.0 |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
perspective
-
Ask HN: Who is hiring? (February 2023)
We're looking for senior product managers and engineers of all experience levels to build the next generation of collaborative data visualization. At the Prospective Co., you'll contribute to our existing open-source project as well as help design our enterprise offering.
https://perspective.finos.org/
We're looking for any of:
- Familiarity with WebAssembly, data visualization, WebGL/OpenGL, data science, Jupyter/notebook, web/desktop/mobile UI development, compiler/language or database design, finance services.
- Primary stack is Rust (targeting WebAssembly). JavaScript, C++ and Python are a big plus.
- We <3 GitHub contributors - opt to discuss your GitHub work in lieu of a technical interview.
Contact [email protected]
- NYC Slice
- Data Visualization Framework for React, Angular, Svelte, TypeScript, JavaScript
- Nocodb: Turns Any MySQL, Postgres, SQLite into a Spreadsheet with REST APIs
- Ask HN: Who is hiring? (October 2022)
- Ask HN: Who is hiring? (September 2022)
-
Official /r/rust "Who's Hiring" thread for job-seekers and job-offerers [Rust 1.63]
DESCRIPTION: We're looking for senior product managers and engineers of all experience levels to build the next generation of collaborative data visualization. At the Prospective Co., you'll contribute to our existing open-source project (Perspective https://perspective.finos.org/) as well as help design our enterprise offering. We're looking for any of: - Familiarity with WebAssembly, data visualization, WebGL/OpenGL, data science, Jupyter/notebook, web/desktop/mobile UI development, compiler/language or database design, finance services. - Primary stack is Rust (targeting WebAssembly, especially Yew). JavaScript, C++ and Python are a big plus. - We <3 GitHub contributors - opt to discuss your GitHub work in lieu of a technical interview.
- Show HN: Grid.js – Advanced table library that works everywhere (2020)
- Memray is a memory profiler for Python by Bloomberg
-
Is React performant enough for trading applications?
Thank you. I saw this, https://github.com/finos/perspective
Apache Arrow
-
Tried polars in Ruby
Several members of the Ruby community have been deeply involved in the development of Apache Arrow.
-
IPC communication between rust, c++, and python
In that case, why not use polars, which supports apache arrow format which supports C, C++, Rust, Python and supports zero-copy read.
-
Introducing ArrowJS • Reactivity without the framework
I think the naming will likely cause some confusion with apache arrow. My initial thoughts when reading "Introducing ArrowJS" was a new port of the apache arrow spec.
-
Java Serialization with Protocol Buffers
The information can be stored in a database or as files, serialized in a standard format and with a schema agreed with your Data Engineering team. Depending on your information and requirements, it can be as simple as CSV, XML or JSON, or Big Data formats such as Parquet, Avro, ORC, Arrow, or message serialization formats like Protocol Buffers, FlatBuffers, MessagePack, Thrift, or Cap'n Proto.
-
GlueSQL: A SQL database engine written as a library in Rust
Just another embedded SQL engine.
There are SQLite(OLTP), DuckDB(OLAP) and some engine-based project like mentioned Apache Arrow(https://arrow.apache.org/)(OLAP): Apache Arrow has many language implementations, some do not include the query engine(for example, Rust implementation, which depends on the DataFusion for more SQL-like analytics) in its own repo, but other do include(for example, C++).
There is a comprehensive benchmark by ClickHouse for OLAP but including kinds of embedding engines: https://benchmark.clickhouse.com/
The more interesting is that, in fact, we have not an embedded HTAP engine. One of my database products already implements 3/4 HTAP at the engine layer, but unfortunately it's still just a free software, not an open source implementation.
- Looking for a cross-language communication framework between Rust and Python
-
C++ Jobs - Q4 2022
Apache Arrow
-
New Pandas-for-Haskell data frame library: Name suggestions
This is a meta-request for the library, but imo it would be really awesome if it used a data structure compatible with Arrow: https://arrow.apache.org/
-
How to Deploy ML Models Using Gravity AI and Meadowrun
As a bit of an aside, you could imagine a way to get the best of both worlds with an extension to Docker that would allow you to publish a container that exposes a Python API, so that someone could call sentiment = call_container_api(image="huggingface/transformers", "my input text") directly from their python code. This would effectively be a remote procedure call into a container that is not running as a service but instead spun up just for the purpose of executing a function on-demand. This feels like a really heavyweight approach to solving dependency hell, but if your libraries are using a cross-platform memory format (hello Apache Arrow!) under the covers, you could imagine doing some fun tricks like giving the container a read-only view into the caller’s memory space to reduce the overhead. It’s a bit implausible, but sometimes it’s helpful to sketch out these ideas to clarify the tradeoffs we’re making with the more practical bits of technology we have available.
-
Scala needs a good, dependency-free DataFrame library
I assume you mean to use Apache arrow rather than scala Arrow?
What are some alternatives?
h5py - HDF5 for Python -- The h5py package is a Pythonic interface to the HDF5 binary data format.
Apache Spark - Apache Spark - A unified analytics engine for large-scale data processing
Airflow - Apache Airflow - A platform to programmatically author, schedule, and monitor workflows
polars - Fast multi-threaded, hybrid-out-of-core DataFrame library in Rust | Python | Node.js
ClickHouse - ClickHouse® is a free analytics DBMS for big data
FlatBuffers - FlatBuffers: Memory Efficient Serialization Library
duckdb_and_r - My thoughts and examples on DuckDB and R
ta-lib-python - Python wrapper for TA-Lib (http://ta-lib.org/).
Apache HBase - Apache HBase
beam - Apache Beam is a unified programming model for Batch and Streaming data processing.
arquero - Query processing and transformation of array-backed data tables.