arquero
perspective
Our great sponsors
arquero | perspective | |
---|---|---|
8 | 43 | |
1,170 | 7,384 | |
2.7% | 3.7% | |
5.1 | 9.4 | |
13 days ago | 7 days ago | |
JavaScript | C++ | |
BSD 3-clause "New" or "Revised" License | Apache License 2.0 |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
arquero
-
Show HN: Matrices – explore, visualize, and share large datasets
Hey HN, I'm excited to share a new side project I've been working on.
The product is called Matrices. You can check it out here: https://matrices.com/.
With Matrices, you can *explore*, *visualize*, and *share* large (100k rows) datasets–all without code. Filter data down to just what you want, visualize it with built-in charts, and share your results with one click.
You can use it today (no login or waitlist or anything). Just copy and paste your data from a google sheet or CSV file.
It's hard to describe the feeling of "gliding over data" you get with Matrices, so I'd rather *show* you how it works instead. This 75s video will give you a sense of how it works: https://www.youtube.com/watch?v=Rrh9_I3Ux8E.
Data is stored locally in your browser until you publish it, though small sample does go to the OpenAI APIs for AI-assisted features.
I started building Matrices because I wanted a tool that made it easy to explore new datasets. When I'm first trying to dig into data, I'll have one question... that leads to another... that will invariably lead to five more questions. It's sort of a fractal process, and I couldn't find many good options that were fast, responsive, and visual.
I figured this crowd would be interested in tech stack as well, it's using arquero [1] bindings over apache arrow for in-memory analytics, and visx [2] for visualizations. I'd like to add duckdb-wasm support at some point to open up a wider set of databases. Data is serialized as parquet to save a bit on bandwidth + storage.
Give it a spin, and let me know what you think. This is my first 'serious frontend project' so I appreciate any and all feedback and bug reports. Feel free to comment here (I'll be around most of the day), or shoot me a note: [email protected]
- Goodbye, Node.js Buffer
-
Hal9: Data Science with JavaScript
Transformations: We found out that JavaScript in combination with D3.js has a pretty decent set of data transformation functions; however, it comes nowhere near to Pandas or dplyr. We found out about Tidy.js quite early, loved it, and adopted it. The combination of Tidy.js and D3.js and Plot.js is absolutely amazing for visualizations and data wrangling with small datasets, say 10-100K rows. We were very happy with this for a while; however, once you move away from visualizations into real-world data analysis, we found out 100K rows restrictive, which gets worse when having 100 or 1K columns. So we switched gears and started using Arquero.js, which happens to be columnar and enabled us to process +1M rows in the browser, descent size for real-world data analysis.
-
Apache Arrow 3.0.0 Release
Take a look at the arquero library from a research group at University of Washington (the same group that D3 came out of). https://github.com/uwdata/arquero
perspective
-
Show HN: WhatTheDuck – open-source, in-browser SQL on CSV files
SQL workbench also uses https://perspective.finos.org/ for tables. It's a WASM table library which pairs nicely with duckdb and works well with large tables.
-
React Spreadsheet 2 – Your Own Google Sheets
Yes. We are working on adding support for aggregation and pivoting using https://github.com/finos/perspective
-
Show HN: DataSheetGrid, an Airtable-like React component
I haven't looked extensively at react-datasheet. It looks like it is trying to build more of a full product than the other data tables.
I have used ag-grid extensively, its an impressive product. Some pieces are a little awkward to use, particularly auto-sizing. But generally ag-grid has thought of most functionality and has a solution. The creator of ag-grid had a great interview on Javascript Jabber [1].
The other serious data table component that I have seen is FinOS Perspective [2]. This is extremely high performance, also more specialized and probably harder to customize. I think Perspective renders to a canvas element from Rust/C++ compiled to WASM (not 100% sure). It is also made for streaming updates.
AG-Grid supports streaming updates... but only in the commercial version.
Eventually the data model for these types of tables becomes tricky. I will be investigating parquet-wasm for my use case. Hit me up if you want to collaborate.
-
ChDB: Embedded OLAP SQL Engine Powered by ClickHouse
Something like https://github.com/finos/perspective ? We use an OLAP(-y) WASM engine to provide query-ability to our data visualization tool, and doing the calculations in the browser is cheaper and simpler than a server-side database for datasets that fit in browser memory.
- Show HN: Udsv.js – A faster CSV parser in 5KB (min)
-
Perspective 2.0, Open Source WebAssembly-Powered BI
It's an open source project. You could create an issue on their GitHub repo, or better yet, create a PR and reference this existing issue:
-
Ask HN: Who is hiring? (February 2023)
We're looking for senior product managers and engineers of all experience levels to build the next generation of collaborative data visualization. At the Prospective Co., you'll contribute to our existing open-source project as well as help design our enterprise offering.
https://perspective.finos.org/
We're looking for any of:
- Familiarity with WebAssembly, data visualization, WebGL/OpenGL, data science, Jupyter/notebook, web/desktop/mobile UI development, compiler/language or database design, finance services.
- Primary stack is Rust (targeting WebAssembly). JavaScript, C++ and Python are a big plus.
- We <3 GitHub contributors - opt to discuss your GitHub work in lieu of a technical interview.
Contact [email protected]
- NYC Slice
- Data Visualization Framework for React, Angular, Svelte, TypeScript, JavaScript
What are some alternatives?
ag-Grid - The best JavaScript Data Table for building Enterprise Applications. Supports React / Angular / Vue / Plain JavaScript.
nocodb - 🔥 🔥 🔥 Open Source Airtable Alternative
datapane - Build and share data reports in 100% Python
Apache Arrow - Apache Arrow is a multi-language toolbox for accelerated data interchange and in-memory processing
hal9ai - Hal9 — Data apps powered by code and LLMs [Moved to: https://github.com/hal9ai/hal9]
ClickHouse - ClickHouse® is a free analytics DBMS for big data
regression-js - Curve Fitting in JavaScript.
SandDance - Visually explore, understand, and present your data.
arrow-julia - Official Julia implementation of Apache Arrow
magic-trace - magic-trace collects and displays high-resolution traces of what a process is doing
vega-loader-arrow - Data loader for the Apache Arrow format.