Building a new vector based storage model

This page summarizes the projects mentioned and recommended in the original post on news.ycombinator.com

InfluxDB - Power Real-Time Data Analytics at Scale
Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
www.influxdata.com
featured
SaaSHub - Software Alternatives and Reviews
SaaSHub helps you find the best software and product alternatives
www.saashub.com
featured
  • QuestDB

    An open source time-series database for fast ingest and SQL queries

  • Hey everyone! I'm Vlad, co-founder/CTO of QuestDB, an open-source database for time series data focusing on performance (https://github.com/questdb/questdb).

    We launched QuestDB last summer [1] and this was our first iteration. We also showed some of our querying capabilities by putting QuestDB on the internet with a 1.6 billion rows data set to query, live [2]! Our storage model is vector-based and append-only. This meant that all incoming data had to arrive in the correct time order. This worked well for some use cases but we quickly realised its limitation in "real-world" use cases, where data doesn't always land at the database in chronological order (we explain this in more details). In short, we were not ready to face the world! We were naive and had a lot to learn. We saw plenty of developers and users come and go specifically because of this technical limitation.

    After we decided to deal with out-of-order data, the big decision to make was which direction we would choose to tackle the problem. LSM trees seemed an obvious choice, but we chose an alternative route so we could keep the performance we spent years building. Our latest release supports out-of-order ingestion by re-ordering data on the fly, and we're happy to share our approach to solving this challenge.

    Additionally, we had many people asking about the differences between QuestDB and other open-source databases and why users should consider giving it a try instead of other systems. When we launched on HN, we saw even more interest in technical analyses, and readers showed a lot of interest in side-by-side comparisons to other databases on the market. One suggestion [3] that we thought would be great to try out was to benchmark ingestion and query speeds using the Time Series Benchmark Suite (TSBS) [4] developed by TimescaleDB. We're super excited to share the results in the article.

    I'd love to hear your feedback on this release and I'm happy to answer any questions you have, along with the rest of the QuestDB team.

    Vlad

    [1] https://news.ycombinator.com/item?id=23975807

  • tsbs

    Time Series Benchmark Suite, a tool for comparing and evaluating databases for time series data

  • InfluxDB

    Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

    InfluxDB logo
NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a more popular project.

Suggest a related project

Related posts

  • How we achieved write speeds of 1.4 million rows per second

    3 projects | dev.to | 21 May 2021
  • How we achieved write speeds of 1.4 million rows per second

    3 projects | /r/programming | 12 May 2021
  • Show HN: QuestDB with Python, Pandas and SQL in a Jupyter notebook – no install

    3 projects | news.ycombinator.com | 21 Feb 2023
  • Normalizing Grafana charts with window functions

    1 project | dev.to | 10 Jan 2024
  • How to increase Grafana refresh rate frequency

    1 project | dev.to | 10 Jan 2024