Working with data files too large for RAM

This page summarizes the projects mentioned and recommended in the original post on /r/learnpython

InfluxDB - Power Real-Time Data Analytics at Scale
Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
www.influxdata.com
featured
SaaSHub - Software Alternatives and Reviews
SaaSHub helps you find the best software and product alternatives
www.saashub.com
featured
  • h5py

    HDF5 for Python -- The h5py package is a Pythonic interface to the HDF5 binary data format.

  • There's some good answers here, but another option I haven't seen suggested: Convert your txt file to HDF5 (Regardless if you follow my approach here, you should really consider converting your data to anything but a txt file). There's a nice library for working with it in python called h5py. The HDF format is designed specifically with working with very large sets of data (it even has compression options), often scientific in nature, but it's not a database. As far as how this fixes the specific issue you you've described, you can utilize numpy slicing to load one chunk your data at a time. Here's a stackoverflow answer which discusses a solution.

  • InfluxDB

    Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

    InfluxDB logo
NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a more popular project.

Suggest a related project

Related posts

  • How to Build a Logistic Regression Model: A Spam-filter Tutorial

    1 project | dev.to | 5 May 2024
  • Granary: The Social Web Translator

    1 project | news.ycombinator.com | 5 May 2024
  • Bitcoin Sentiment Analysis using Python and X (Formerly Twitter)

    1 project | dev.to | 5 May 2024
  • Zero Shot Text Classification Under the hood

    1 project | dev.to | 5 May 2024
  • Demystifying OS Concepts (Part 2): Other Synchronization Primitives

    1 project | dev.to | 5 May 2024