Recommendation for a Database for analysis

Our great sponsors

InfluxDB - Power Real-Time Data Analytics at Scale

WorkOS - The modern identity platform for B2B SaaS

SaaSHub - Software Alternatives and Reviews

Our great sponsors

ta-lib-python

23 8,991 7.3 Cython

Python wrapper for TA-Lib (http://ta-lib.org/).

TA-Lib

bcolz

1 955 0.0 C

Discontinued A columnar data container that can be compressed.

What you need for your use case is a column-oriented store. I recommend explore bcolz or apache arrow for a column file-based systems. These are very fast, support memory mapping, uses compression and SSD speed (and even CPU architecture, in case of arrow) optimally almost out of the box, and has good interfaces to Numpy and Pandas (in case you are using Python for final data consumption and analysis). The columnar structure makes it easy to add or delete a column easily (or even dynamically). If you need a more scalable (albeit at the cost of speed) solution, you can devise a schema over a regular columnar db or an nosql db - see arctic from Man group for an example.

InfluxDB

www.influxdata.com sponsored

Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
Apache Arrow

75 13,480 10.0 C++

Apache Arrow is a multi-language toolbox for accelerated data interchange and in-memory processing

What you need for your use case is a column-oriented store. I recommend explore bcolz or apache arrow for a column file-based systems. These are very fast, support memory mapping, uses compression and SSD speed (and even CPU architecture, in case of arrow) optimally almost out of the box, and has good interfaces to Numpy and Pandas (in case you are using Python for final data consumption and analysis). The columnar structure makes it easy to add or delete a column easily (or even dynamically). If you need a more scalable (albeit at the cost of speed) solution, you can devise a schema over a regular columnar db or an nosql db - see arctic from Man group for an example.

arctic

52 3,028 7.1 Python

High performance datastore for time series and tick data

What you need for your use case is a column-oriented store. I recommend explore bcolz or apache arrow for a column file-based systems. These are very fast, support memory mapping, uses compression and SSD speed (and even CPU architecture, in case of arrow) optimally almost out of the box, and has good interfaces to Numpy and Pandas (in case you are using Python for final data consumption and analysis). The columnar structure makes it easy to add or delete a column easily (or even dynamically). If you need a more scalable (albeit at the cost of speed) solution, you can devise a schema over a regular columnar db or an nosql db - see arctic from Man group for an example.

trading-utils

8 109 9.0 Python

Collection of scripts and utilities for stock market analysis, strategies etc

I do the exact thing with a CSV file. The project is open source here https://github.com/namuan/trading-utils/ if you want to have a look.

WorkOS

workos.com sponsored

The modern identity platform for B2B SaaS. The APIs are flexible and easy-to-use, supporting authentication, user identity, and complex enterprise features like SSO and SCIM provisioning.

NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a more popular project.

Suggest a related project

Interacting with Amazon S3 using AWS Data Wrangler (awswrangler) SDK for Pandas: A Comprehensive Guide
5 projects | dev.to | 20 Aug 2023
How to use Spark and Pandas to prepare big data
3 projects | dev.to | 10 May 2022
How to use Spark and Pandas to prepare big data
3 projects | dev.to | 21 Sep 2021
Arrow v1.0: After 8 years, a new milestone with a lot of new features
3 projects | news.ycombinator.com | 26 Feb 2021
AutoCodeRover resolves 22% of real-world GitHub in SWE-bench lite
8 projects | news.ycombinator.com | 9 Apr 2024

Recommendation for a Database for analysis

This page summarizes the projects mentioned and recommended in the original post on /r/algotrading
Python Science and Data analysis ta-lib Arrow tickstore
Post date: 13 May 2021

ta-lib-python

bcolz

InfluxDB

Apache Arrow

arctic

trading-utils

WorkOS

Related posts

Recommendation for a Database for analysis

This page summarizes the projects mentioned and recommended in the original post on /r/algotrading Python Science and Data analysis ta-lib Arrow tickstore Post date: 13 May 2021

ta-lib-python

bcolz

InfluxDB

Apache Arrow

arctic

trading-utils

WorkOS

Related posts

This page summarizes the projects mentioned and recommended in the original post on /r/algotrading
Python Science and Data analysis ta-lib Arrow tickstore
Post date: 13 May 2021