Show HN: Stanchion – Column-oriented tables in SQLite

InfluxDB - Power Real-Time Data Analytics at Scale

Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

www.influxdata.com

featured

SaaSHub - Software Alternatives and Reviews

SaaSHub helps you find the best software and product alternatives

www.saashub.com

featured

stanchion

3 618 8.9 C

A SQLite extension that brings column-oriented tables to SQLite

The "Data Storage Internals" section[1] of the README sounds to me like it has its own column-oriented format for these tables, at least that's how I'm reading the part about segments. Is that the case? If so, have you tried using Apache Arrow or Parquet to see how they compare?
[1] https://github.com/dgllghr/stanchion#data-storage-internals

ClickBench

71 571 9.0 HTML

ClickBench: a Benchmark For Analytical Databases

Interesting project! Thank you for open sourcing and sharing. Agree that local and embedded analytics are an increasing trend, I see it too.
A couple of questions:
* I’m curious what the difficulties were in the implementation. I suspect it is quite a challenge to implement this support in the current SQLite architecture, and would curious to know which parts were tricky and any design trade-off you were faced with.
* Aside from ease-of-use (install extension, no need for a separate analytical database system), I wonder if there are additional benefits users can anticipate resulting from a single system architecture vs running an embedded OLAP store like DuckDB or clickhouse-local / chdb side-by-side with SQLite? Do you anticipate performance or resource efficiency gains, for instance?
* I am also curious, what the main difficulty with bringing in a separate analytical database is, assuming it natively integrates with SQLite. I may be biased, but I doubt anything can approach the performance of native column-oriented systems, so I'm curious what the tipping point might be for using this extension vs using an embedded OLAP store in practice.
Btw, would love for you or someone in the community to benchmark Stanchion in ClickBench and submit results! (https://github.com/ClickHouse/ClickBench/)
Disclaimer: I work on ClickHouse.

InfluxDB

www.influxdata.com featured

Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a more popular project.

Suggest a related project

Umbra: A Disk-Based System with In-Memory Performance [pdf]

3 projects | news.ycombinator.com | 2 May 2024
Variant in Apache Doris 2.1.0: a new data type 8 times faster than JSON for semi-structured data analysis

2 projects | dev.to | 27 Mar 2024
ClickBench – A Benchmark for Analytical DBMS

1 project | news.ycombinator.com | 8 Feb 2024
Why Postgres RDS didn't work for us

4 projects | news.ycombinator.com | 3 Feb 2024
ClickBench: A Benchmark for Analytical Databases

1 project | news.ycombinator.com | 22 Jan 2024

Show HN: Stanchion – Column-oriented tables in SQLite

This page summarizes the projects mentioned and recommended in the original post on news.ycombinator.com
Olap Analytics column-oriented Benchmark column-store
Post date: 31 Jan 2024

stanchion

ClickBench

InfluxDB

Related posts

Umbra: A Disk-Based System with In-Memory Performance [pdf]

Variant in Apache Doris 2.1.0: a new data type 8 times faster than JSON for semi-structured data analysis

ClickBench – A Benchmark for Analytical DBMS

Why Postgres RDS didn't work for us

ClickBench: A Benchmark for Analytical Databases

Show HN: Stanchion – Column-oriented tables in SQLite

This page summarizes the projects mentioned and recommended in the original post on news.ycombinator.com Olap Analytics column-oriented Benchmark column-store Post date: 31 Jan 2024

stanchion

ClickBench

InfluxDB

Related posts

Umbra: A Disk-Based System with In-Memory Performance [pdf]

Variant in Apache Doris 2.1.0: a new data type 8 times faster than JSON for semi-structured data analysis

ClickBench – A Benchmark for Analytical DBMS

Why Postgres RDS didn't work for us

ClickBench: A Benchmark for Analytical Databases

This page summarizes the projects mentioned and recommended in the original post on news.ycombinator.com
Olap Analytics column-oriented Benchmark column-store
Post date: 31 Jan 2024