duckdb-wasm
graphic-walker
duckdb-wasm | graphic-walker | |
---|---|---|
11 | 20 | |
924 | 2,235 | |
5.2% | 1.8% | |
9.5 | 9.4 | |
4 days ago | 8 days ago | |
C++ | TypeScript | |
MIT License | Apache License 2.0 |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
duckdb-wasm
-
Parquet-WASM: Rust-based WebAssembly bindings to read and write Parquet data
i think duckdb-wasm is closer to 6MB over wire, but ~36MB once decompressed. (see net panel when loading https://shell.duckdb.org/)
the decompressed size should be okay since it's not the same as parsing and JITing 36MB of JS.
- 42.parquet – A Zip Bomb for the Big Data Age
-
Show HN: Open-source, browser-local data exploration using DuckDB-WASM and PRQL
Hey HN! We’ve built Pretzel, an open-source data exploration and visualization tool that runs fully in the browser and can handle large files (200 MB CSV on my 8gb MacBook air is snappy). It’s also reactive - so if, for example, you change a filter, all the data transform blocks after it re-evaluate automatically. You can try it here: https://pretzelai.github.io/ (static hosted webpage) or see a demo video here: https://www.youtube.com/watch?v=73wNEun_L7w
You can play with the demo CSV that’s pre-loaded (GitHub data of text-editor adjacent projects) or upload your own CSV/XLSX file. The tool runs fully in-browser—you can disconnect from the internet once the website loads—so feel free to use sensitive data if you like.
Here’s how it works: You upload a CSV file and then, explore your data as a series of successive data transforms and plots. For example, you might: (1) Remove some columns; (2) Apply some filters (remove nulls, remove outliers, restrict time range etc); (3) Do a pivot (i.e, a group-by but fancier); (4) Plot a chart; (5) Download the chart and the the transformed data. See screenshot: https://imgur.com/a/qO4yURI
In the UI, each transform step appears as a “Block”. You can always see the result of the full transform in a table on the right. The transform blocks are editable - for instance in the example above, you can go to step 2, change some filters and the reactivity will take care of re-computing all the cells that follow, including the charts.
We wanted Pretzel to run locally in the browser and be extremely performant on large files. So, we parse CSVs with the fastest CSV parser (uDSV: https://github.com/leeoniya/uDSV) and use DuckDB-Wasm (https://github.com/duckdb/duckdb-wasm) to do all the heavy lifting of processing the data. We also wanted to allow for chained data transformations where each new block operates on the result of the previous block. For this, we’re using PRQL (https://prql-lang.org/) since it maps 1-1 with chained data transform blocks - each block maps to a chunk of PRQL which when combined, describes the full data transform chain. (PRQL doesn’t support DuckDB’s Pivot statement though so we had to make some CTE based hacks).
There’s also an AI block: This is the only (optional) feature that requires an internet connection but we’re working on adding local model support via Ollama. For now, you can use your own OpenAI API key or use an AI server we provide (GPT4 proxy; it’s loaded with a few credits), specify a transform in plain english and get back the SQL for the transform which you can edit.
Our roadmap includes allowing API calls to create new columns; support for an SQL block with nice autocomplete features, and a Python block (using Pyodide to run Python in the browser) on the results of the data transforms, much like a jupyter notebook.
There’s two of us and we’ve only spent about a week coding this and fixing major bugs so there are still some bugs to iron out. We’d love for you to try this and to get your feedback!
- DuckDB-WASM: WebAssembly Version of DuckDB
-
Show HN: DuckDB-WASM, execute queries in a browser, and share them as links
Amazing, I was eagerly waiting for this one. Loading extensions in previous DuckDB-WASM releases didn't work seamlessly. Looks like now it's the case :D
ref: https://github.com/duckdb/duckdb-wasm/issues/1542#issuecomme...
Thanks!!
-
DuckDB 0.9.0
Btw, it's already happening:
Go to https://shell.duckdb.org, and type
-
Does anyone else hate Pandas?
I like Pandas, but you will love duckdb, which is solving this exact problem: https://duckdb.org/; https://shell.duckdb.org/
-
[Question] Using DuckDB to connect to (external/cloud) Postgres DB
There's also https://shell.duckdb.org/ for playing around.
- Ask HN: What tech is under the radar with all attention on ChatGPT etc.
-
My first Rust project: Xlsx-wasm-parser. A WebAssembly-wrapper around the Calamine crate to bring Blazingly Fast Excel deserialization to the Browser and NodeJS.
I know xls != csv, but would be cool to compare against https://github.com/duckdb/duckdb-wasm as well
graphic-walker
-
Show HN: Open-source, browser-local data exploration using DuckDB-WASM and PRQL
[2] https://github.com/Kanaries/graphic-walker/issues/330
-
Show HN: GPT and tableau-style interface in R for data visualization
GWalkR is an open-source R library that allows you to turn your data frame into a tableau style user interface for data exploration and visualization. It also allows you to analysis your data with natural language questions.
GWalkR is the R binding of graphic-walker: https://github.com/Kanaries/graphic-walker
- FLaNK Stack for 4th of July
-
Observable Plot: The JavaScript library for exploratory data visualization
Big fan of D3.js and now there is Observable Plot! I am building several data visualization software for exploratory data analysis:
RATH, auto exploratory data analysis: https://github.com/Kanaries/Rath
GraphicWalker, embeddable data exploration component: https://github.com/Kanaries/graphic-walker
They are using vega-lite for now. But there is a limit of building more fancy and customized visualizations. It seems Plot has a more flexible layer based visualization system that can support larger design space.
Is Plot stable enough now to migrate from vega-lite based system to Plot based? Are there any large milestone or roadmap of Plot in future?
-
Show HN: RATH – Open-Source Copilot and Autopilot for Data Analysis
+ Graphic Walker (https://github.com/Kanaries/graphic-walker): A lite embeddable component for visual analysis.
+ PyGWalker (https://github.com/Kanaries/pygwalker): turning your pandas dataframe into a Tableau-style User Interface for visual exploration.
RATH is a collection of interesting ideas that we think the next generation of data analysis software should be, so there might be many features that not well organized to be a united app. Tell me which feature you prefer and which is not. Looking forward for your ideas and advice.
-
Hey /r/SQL! I created a tool for data analysts to save time and visualize data using DuckDB - looking for feedback
I know you said you *dont* want a tableau like interface, but in case you do this might be a cool open source project to check out: https://github.com/Kanaries/graphic-walker
-
Show HN: Turn Your Pandas Dataframe to a Tableau-Style UI for Visual Analysis
> it seems like the heavy lifting is done by the web app here: https://github.com/Kanaries/graphic-walker
FWIW, both are made by the same entity, Kanaries.
-
Tools for Integrating Interactive Dashboards on a Website without Licensing Problems?
You may try to use and modify based on the OpenSource Graphic Walker: https://github.com/Kanaries/graphic-walker
- Easier Data Visualization & Exploration in React: Graphic Walker
-
Graphic Walker, A Different Type of Open Source Tableau Alternative
Graphic Walker is designed to be easy to embed in other applications as a React component. Check out the code here: https://github.com/Kanaries/graphic-walker
What are some alternatives?
web-llm - Bringing large-language models and chat to web browsers. Everything runs inside the browser with no server support.
Rath - Next generation of automated data exploratory analysis and visualization platform.
mutable - A Database System for Research and Fast Prototyping
superset - Apache Superset is a Data Visualization and Data Exploration Platform
chdb - chDB is an embedded OLAP SQL Engine 🚀 powered by ClickHouse
streamlit - Streamlit — A faster way to build and share data apps.
ch32v003fun - An open source software development stack for the CH32V003 10¢ 48 MHz RISC-V Microcontroller - as well as many other chips within the ch32v/x line.
vega-embed - Publish Vega visualizations as embedded web components with interactive parameters.
bacalhau - Compute over Data framework for public, transparent, and optionally verifiable computation
pygwalker - PyGWalker: Turn your pandas dataframe into an interactive UI for visual analysis
duckdb - DuckDB is an in-process SQL OLAP Database Management System
pivottable - Open-source Javascript Pivot Table (aka Pivot Grid, Pivot Chart, Cross-Tab) implementation with drag'n'drop.