parquet-wasm vs arquero

Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

www.influxdata.com

featured

SaaSHub - Software Alternatives and Reviews

SaaSHub helps you find the best software and product alternatives

www.saashub.com

featured

parquet-wasm		arquero
	Project
6	Mentions	8
464	Stars	1,188
-	Growth	1.5%
9.0	Activity	4.6
3 days ago	Latest Commit	about 1 month ago
Rust	Language	JavaScript
Apache License 2.0	License	BSD 3-clause "New" or "Revised" License

The number of mentions indicates the total number of mentions that we've tracked plus the number of user suggested alternatives.
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.

parquet-wasm

Posts with mentions or reviews of parquet-wasm. We have used some of these posts to build our list of alternatives and similar projects. The last one was on 2024-04-29.

FLaNK AI Weekly for 29 April 2024
44 projects | dev.to | 29 Apr 2024
Parquet-WASM: Rust-based WebAssembly bindings to read and write Parquet data
5 projects | news.ycombinator.com | 22 Apr 2024
Goodbye, Node.js Buffer
15 projects | news.ycombinator.com | 24 Oct 2023

nodejs-polars is node-specific and uses native FFI. polars can be compiled to Wasm but doesn't yet have a js API out of the box.
As for the fastest way to serialize data to Pandas data to the browser, you should use Parquet; it's the fastest to write on the Python side and read on the JS side, while also being compressed. See https://github.com/kylebarron/parquet-wasm (full disclosure, I wrote this)
Rust 1.63.0
14 projects | news.ycombinator.com | 11 Aug 2022

I'm building WebAssembly bindings to existing Rust libraries [0] and lower-dependency geospatial tools [1]. Rust makes it very easy to bind rust code to both WebAssembly and Python. And by avoiding some large C geospatial dependencies we can get reliable performance in both wasm and Python using the exact same codebase.
[0]: https://github.com/kylebarron/parquet-wasm
[1]: https://github.com/kylebarron/geopolars
Why isn’t there a decent file format for tabular data?
13 projects | news.ycombinator.com | 3 May 2022
Recommendations when publishing a WASM library
4 projects | news.ycombinator.com | 28 Mar 2022

Looks to be a great resource. I've been working on a WASM implementation of reading and writing Apache Parquet [0] and it's been difficult being new to WASM to find the best way of distributing the WASM that works on Node and through bundlers like Webpack.
[0]: https://github.com/kylebarron/parquet-wasm

arquero

Posts with mentions or reviews of arquero. We have used some of these posts to build our list of alternatives and similar projects. The last one was on 2023-12-07.

Show HN: Matrices – explore, visualize, and share large datasets
2 projects | news.ycombinator.com | 7 Dec 2023

Hey HN, I'm excited to share a new side project I've been working on.
The product is called Matrices. You can check it out here: https://matrices.com/.
With Matrices, you can *explore*, *visualize*, and *share* large (100k rows) datasets–all without code. Filter data down to just what you want, visualize it with built-in charts, and share your results with one click.
You can use it today (no login or waitlist or anything). Just copy and paste your data from a google sheet or CSV file.
It's hard to describe the feeling of "gliding over data" you get with Matrices, so I'd rather *show* you how it works instead. This 75s video will give you a sense of how it works: https://www.youtube.com/watch?v=Rrh9_I3Ux8E.
Data is stored locally in your browser until you publish it, though small sample does go to the OpenAI APIs for AI-assisted features.
I started building Matrices because I wanted a tool that made it easy to explore new datasets. When I'm first trying to dig into data, I'll have one question... that leads to another... that will invariably lead to five more questions. It's sort of a fractal process, and I couldn't find many good options that were fast, responsive, and visual.
I figured this crowd would be interested in tech stack as well, it's using arquero [1] bindings over apache arrow for in-memory analytics, and visx [2] for visualizations. I'd like to add duckdb-wasm support at some point to open up a wider set of databases. Data is serialized as parquet to save a bit on bandwidth + storage.
Give it a spin, and let me know what you think. This is my first 'serious frontend project' so I appreciate any and all feedback and bug reports. Feel free to comment here (I'll be around most of the day), or shoot me a note: [email protected]
[1]: https://uwdata.github.io/arquero/
Goodbye, Node.js Buffer
15 projects | news.ycombinator.com | 24 Oct 2023

https://github.com/uwdata/arquero
Arquero is a JavaScript library for query processing and transformation of array-backed data tables
1 project | /r/programming | 24 Jul 2022
Arquero – data tables wrangling in JavaScript
1 project | news.ycombinator.com | 22 Jul 2022
Hal9: Data Science with JavaScript
4 projects | /r/datascience | 9 Sep 2021

Transformations: We found out that JavaScript in combination with D3.js has a pretty decent set of data transformation functions; however, it comes nowhere near to Pandas or dplyr. We found out about Tidy.js quite early, loved it, and adopted it. The combination of Tidy.js and D3.js and Plot.js is absolutely amazing for visualizations and data wrangling with small datasets, say 10-100K rows. We were very happy with this for a while; however, once you move away from visualizations into real-world data analysis, we found out 100K rows restrictive, which gets worse when having 100 or 1K columns. So we switched gears and started using Arquero.js, which happens to be columnar and enabled us to process +1M rows in the browser, descent size for real-world data analysis.
Arquero – Query processing and transformation of array-backed data tables
1 project | news.ycombinator.com | 16 Feb 2021
Apache Arrow 3.0.0 Release
10 projects | news.ycombinator.com | 3 Feb 2021

Take a look at the arquero library from a research group at University of Washington (the same group that D3 came out of). https://github.com/uwdata/arquero

What are some alternatives?

When comparing parquet-wasm and arquero you can also consider the following projects:

datasette-stripe - A web SQL interface to your Stripe account using Datasette.

perspective - A data visualization and analytics component, especially well-suited for large and/or streaming datasets.

quickjs-emscripten - Safely execute untrusted Javascript in your Javascript, and execute synchronous code that uses async functions

Apache Arrow - Apache Arrow is a multi-language toolbox for accelerated data interchange and in-memory processing

transmitic - Encrypted, peer to peer, file transfer program :: https://discord.gg/tRT3J6T :: https://www.reddit.com/r/transmitic/ :: https://twitter.com/transmitic

hal9ai - Hal9 — Data apps powered by code and LLMs [Moved to: https://github.com/hal9ai/hal9]

geopolars - Geospatial extensions for Polars

regression-js - Curve Fitting in JavaScript.

odiff - The fastest pixel-by-pixel image visual difference tool in the world.

arrow-julia - Official Julia implementation of Apache Arrow

rson - Rust Object Notation

cylon - Cylon is a fast, scalable, distributed memory, parallel runtime with a Pandas like DataFrame.

parquet-wasm vs datasette-stripe arquero vs perspective parquet-wasm vs quickjs-emscripten arquero vs Apache Arrow parquet-wasm vs transmitic arquero vs hal9ai parquet-wasm vs geopolars arquero vs regression-js parquet-wasm vs odiff arquero vs arrow-julia parquet-wasm vs rson arquero vs cylon

Compare parquet-wasm vs arquero and see what are their differences.

parquet-wasm

arquero

parquet-wasm

arquero

What are some alternatives?