Show HN: Stanchion – Column-oriented tables in SQLite

This page summarizes the projects mentioned and recommended in the original post on news.ycombinator.com

InfluxDB - Power Real-Time Data Analytics at Scale
Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
www.influxdata.com
featured
SaaSHub - Software Alternatives and Reviews
SaaSHub helps you find the best software and product alternatives
www.saashub.com
featured
  • stanchion

    A SQLite extension that brings column-oriented tables to SQLite

  • The "Data Storage Internals" section[1] of the README sounds to me like it has its own column-oriented format for these tables, at least that's how I'm reading the part about segments. Is that the case? If so, have you tried using Apache Arrow or Parquet to see how they compare?

    [1] https://github.com/dgllghr/stanchion#data-storage-internals

  • ClickBench

    ClickBench: a Benchmark For Analytical Databases

  • Interesting project! Thank you for open sourcing and sharing. Agree that local and embedded analytics are an increasing trend, I see it too.

    A couple of questions:

    * I’m curious what the difficulties were in the implementation. I suspect it is quite a challenge to implement this support in the current SQLite architecture, and would curious to know which parts were tricky and any design trade-off you were faced with.

    * Aside from ease-of-use (install extension, no need for a separate analytical database system), I wonder if there are additional benefits users can anticipate resulting from a single system architecture vs running an embedded OLAP store like DuckDB or clickhouse-local / chdb side-by-side with SQLite? Do you anticipate performance or resource efficiency gains, for instance?

    * I am also curious, what the main difficulty with bringing in a separate analytical database is, assuming it natively integrates with SQLite. I may be biased, but I doubt anything can approach the performance of native column-oriented systems, so I'm curious what the tipping point might be for using this extension vs using an embedded OLAP store in practice.

    Btw, would love for you or someone in the community to benchmark Stanchion in ClickBench and submit results! (https://github.com/ClickHouse/ClickBench/)

    Disclaimer: I work on ClickHouse.

  • InfluxDB

    Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

    InfluxDB logo
NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a more popular project.

Suggest a related project

Related posts

  • Umbra: A Disk-Based System with In-Memory Performance [pdf]

    3 projects | news.ycombinator.com | 2 May 2024
  • Variant in Apache Doris 2.1.0: a new data type 8 times faster than JSON for semi-structured data analysis

    2 projects | dev.to | 27 Mar 2024
  • ClickBench – A Benchmark for Analytical DBMS

    1 project | news.ycombinator.com | 8 Feb 2024
  • Why Postgres RDS didn't work for us

    4 projects | news.ycombinator.com | 3 Feb 2024
  • ClickBench: A Benchmark for Analytical Databases

    1 project | news.ycombinator.com | 22 Jan 2024