graphic-walker
parquet-format
graphic-walker | parquet-format | |
---|---|---|
20 | 4 | |
2,243 | 1,645 | |
2.1% | 1.7% | |
9.4 | 7.2 | |
8 days ago | 9 days ago | |
TypeScript | Thrift | |
Apache License 2.0 | Apache License 2.0 |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
graphic-walker
-
Show HN: Open-source, browser-local data exploration using DuckDB-WASM and PRQL
[2] https://github.com/Kanaries/graphic-walker/issues/330
-
Show HN: GPT and tableau-style interface in R for data visualization
GWalkR is an open-source R library that allows you to turn your data frame into a tableau style user interface for data exploration and visualization. It also allows you to analysis your data with natural language questions.
GWalkR is the R binding of graphic-walker: https://github.com/Kanaries/graphic-walker
- FLaNK Stack for 4th of July
-
Observable Plot: The JavaScript library for exploratory data visualization
Big fan of D3.js and now there is Observable Plot! I am building several data visualization software for exploratory data analysis:
RATH, auto exploratory data analysis: https://github.com/Kanaries/Rath
GraphicWalker, embeddable data exploration component: https://github.com/Kanaries/graphic-walker
They are using vega-lite for now. But there is a limit of building more fancy and customized visualizations. It seems Plot has a more flexible layer based visualization system that can support larger design space.
Is Plot stable enough now to migrate from vega-lite based system to Plot based? Are there any large milestone or roadmap of Plot in future?
-
Show HN: RATH – Open-Source Copilot and Autopilot for Data Analysis
+ Graphic Walker (https://github.com/Kanaries/graphic-walker): A lite embeddable component for visual analysis.
+ PyGWalker (https://github.com/Kanaries/pygwalker): turning your pandas dataframe into a Tableau-style User Interface for visual exploration.
RATH is a collection of interesting ideas that we think the next generation of data analysis software should be, so there might be many features that not well organized to be a united app. Tell me which feature you prefer and which is not. Looking forward for your ideas and advice.
-
Hey /r/SQL! I created a tool for data analysts to save time and visualize data using DuckDB - looking for feedback
I know you said you *dont* want a tableau like interface, but in case you do this might be a cool open source project to check out: https://github.com/Kanaries/graphic-walker
-
Show HN: Turn Your Pandas Dataframe to a Tableau-Style UI for Visual Analysis
> it seems like the heavy lifting is done by the web app here: https://github.com/Kanaries/graphic-walker
FWIW, both are made by the same entity, Kanaries.
-
Tools for Integrating Interactive Dashboards on a Website without Licensing Problems?
You may try to use and modify based on the OpenSource Graphic Walker: https://github.com/Kanaries/graphic-walker
- Easier Data Visualization & Exploration in React: Graphic Walker
-
Graphic Walker, A Different Type of Open Source Tableau Alternative
Graphic Walker is designed to be easy to embed in other applications as a React component. Check out the code here: https://github.com/Kanaries/graphic-walker
parquet-format
-
Summing columns in remote Parquet files using DuckDB
Right, there's all sorts of metadata and often stats included in any parquet file: https://github.com/apache/parquet-format#file-format
The offsets of said metadata are well-defined (i.e. in the footer) so for S3 / blob storage so long as you can efficiently request a range of bytes you can pull the metadata without having to read all the data.
- FLaNK Stack for 4th of July
-
I have question related to Parquet files and AWS Glue
As i read here https://github.com/apache/parquet-format/blob/master/LogicalTypes.md , they are store in Integer formats and these integers represent the number of days (for Date) or number of milliseconds, microseconds or nanoseconds (for DateTime) since 1970-01-01. This works as expected with the parquet file that written by our ETL tool from internal database --> S3, all Data/DateTime columns are Integers, means that in Glue Job, i have to convert these Integers back to Date/Datetime value to do some transformation on them. But when parquet files are written by Spark, they are Date/DateTime (or TimeStamp to be more concise) format not Integers (i checked by read these files again in other Glue Job) and that make me confused.
-
Parquet: More than just “Turbo CSV”
Date is confusing with a timezone (UTC or otherwise) and the doco makes no such suggestion.
The Parquet datatypes documentation is pretty clear that there is a flag isAdjustedToUTC to define if the timestamp should be interpreted as having Instant semantics or Local semantics.
https://github.com/apache/parquet-format/blob/master/Logical...
Still no option to include a TZ offset in the data (so the same datum can be interpreted with both Local and Instant semantics) but not bad really.
What are some alternatives?
Rath - Next generation of automated data exploratory analysis and visualization platform.
rapidgzip - Gzip Decompression and Random Access for Modern Multi-Core Machines
superset - Apache Superset is a Data Visualization and Data Exploration Platform
xgen - Salesforce open-source LLMs with 8k sequence length.
streamlit - Streamlit — A faster way to build and share data apps.
wizmap - Explore and interpret large embeddings in your browser with interactive visualization! 📍
vega-embed - Publish Vega visualizations as embedded web components with interactive parameters.
FastSAM - Fast Segment Anything
pygwalker - PyGWalker: Turn your pandas dataframe into an interactive UI for visual analysis
background-removal-js - Remove backgrounds from images directly in the browser environment with ease and no additional costs or privacy concerns. Explore an interactive demo.
pivottable - Open-source Javascript Pivot Table (aka Pivot Grid, Pivot Chart, Cross-Tab) implementation with drag'n'drop.
mdBook - Create book from markdown files. Like Gitbook but implemented in Rust