Ask HN: Best way to provide access to large data sets

This page summarizes the projects mentioned and recommended in the original post on news.ycombinator.com

InfluxDB - Power Real-Time Data Analytics at Scale
Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
www.influxdata.com
featured
SaaSHub - Software Alternatives and Reviews
SaaSHub helps you find the best software and product alternatives
www.saashub.com
featured
  • datasette

    An open source multi-tool for exploring and publishing data

  • I've been researching methods for exposing large data sets to users of my website, and came across the Atlassian Data Lake (https://www.atlassian.com/platform/analytics/what-is-atlassian-data-lake), and was curious - is this a thing that companies are doing now?

    For personal projects and smaller scale non-dynamic data sets I've used Datasette (https://datasette.io/), but haven't seen anyone providing anything at scale beyond either an API for accessing data or some sort of periodic file export.

  • roapi

    Create full-fledged APIs for slowly moving datasets without writing a single line of code.

  • For smaller datasets then anywhere up to a few mb which isn't so bad reasonable with an API but in theory for historic data it could be up to several gb. I've not seen datasette go that high (IIRC it's a 1000 row return limit by default).

    That's what got me intrigued with Atlassians offering, as data lakes tend to be something internal to a company, not something I've ever seen offered as an interaction point to users.

    I've also tested out roapi [1] which is nice if the data is in some structured format already (Parquet/JSON)

    [1] https://github.com/roapi/roapi

  • InfluxDB

    Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

    InfluxDB logo
NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a more popular project.

Suggest a related project

Related posts

  • Zero code API generation using rust

    3 projects | /r/rust | 25 Feb 2021
  • Tuql: Automatically create a GraphQL server from a SQLite database

    6 projects | news.ycombinator.com | 25 Apr 2023
  • What do you guys think is the bare minimum for any of you to feel inclined to use graphql? And what would you say is the easiest way to implement it these days?

    3 projects | /r/graphql | 7 Jul 2022
  • CALISHOT 2021-08: Find ebooks among 403 Calibre sites

    2 projects | /r/opendirectories | 9 Aug 2021
  • This Week In Python

    5 projects | dev.to | 17 Mar 2024