PostgreSQL alternative for Large amounts of data

This page summarizes the projects mentioned and recommended in the original post on /r/PostgreSQL

InfluxDB - Power Real-Time Data Analytics at Scale
Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
www.influxdata.com
featured
SaaSHub - Software Alternatives and Reviews
SaaSHub helps you find the best software and product alternatives
www.saashub.com
featured
  • timescaledb-parallel-copy

    A binary for parallel copying of CSV data into a TimescaleDB hypertable

  • As @dcalde mentioned before, apart from the timescaledb extension, there is parallel-copy (https://github.com/timescale/timescaledb-parallel-copy) which uses multiple threads / connections to bulk insert CSV files into postgresql. It works with timescale's hypertables, but also vanilla postgres tables.

  • datafusion

    Apache DataFusion SQL Query Engine

  • I've run datafusion over a collection of parquet files I had converted from csv. Was neat, but on a single VM it wasn't too hard with my dataset to do a group by query that blew through 64GiB ram.

  • InfluxDB

    Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

    InfluxDB logo
  • pgloader

    Migrate to PostgreSQL in a single command!

  • rows

    A common, beautiful interface to tabular data, no matter the format

  • I have used following python utility https://github.com/turicas/rows , it provides cli that can bulk load csv to postgres tables.

  • bellboy

    Highly performant JavaScript data stream ETL engine.

  • If you are doing processing on each row, I would suggest that take a look at this library https://github.com/Claviz/bellboy It’s leveraging node js streams and provides quite a few essentials required for such etl processing.

  • etlp

    A transducers based etl processing library written in clojure

  • You can take a look into code if you want to, but it’s heavily wip right now https://github.com/aregee/etlp

  • q

    q - Run SQL directly on delimited files and multi-file sqlite databases (by harelba)

  • SaaSHub

    SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives

    SaaSHub logo
NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a more popular project.

Suggest a related project

Related posts

  • Advice on ETL and Data Sharing work process

    1 project | /r/ETL | 7 Nov 2023
  • Best way to introduce a linter?

    1 project | /r/golang | 30 Apr 2023
  • XlOil: The fastest library for writing Excel functions in Python

    2 projects | news.ycombinator.com | 3 Mar 2023
  • Segítség kérés Excel automatizáláshoz

    2 projects | /r/programmingHungary | 16 Feb 2023
  • GitHub - cunnane/xloil: xlOil provides framework for interacting with Excel in different programming languages (python & C++ currently)

    1 project | /r/Python | 3 Feb 2023