SaaSHub helps you find the best software and product alternatives Learn more →
Top 11 pydata Open-Source Projects
-
InfluxDB
Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
-
SaaSHub
SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives
-
pyvtreat
vtreat is a data frame processor/conditioner that prepares real-world data for predictive modeling in a statistically sound manner. Distributed under a BSD-3-Clause license.
-
python-graphblas
Python library for GraphBLAS: high-performance sparse linear algebra for scalable graph analytics
The interesting thing about Polars is that it does not try to be a drop-in replacement to pandas, like Dask, cuDF, or Modin, and instead has its own expressive API. Despite being a young project, it quickly got popular thanks to its easy installation process and its “lightning fast” performance.
pydata related posts
-
Shuffling large data at constant memory in Dask
-
My new company uses Pyspark. I want to learn it before my starting date. Any advice?
-
Great forward progress on squashing cluster deadlocks
-
Is Numpy always more efficient than Pandas? And how much should we rely on Python anyway?
-
Ask HN: Is PySPark a Dead-End?
-
How to load 85.6 GB of XML data into a dataframe
-
How to load 85.6 GB of XML data into a dataframe
-
A note from our sponsor - SaaSHub
www.saashub.com | 3 May 2024
Index
What are some of the best open-source pydata projects? This list will help you:
Project | Stars | |
---|---|---|
1 | Dask | 11,999 |
2 | cudf | 7,291 |
3 | koalas | 3,319 |
4 | stumpy | 2,994 |
5 | pandas-datareader | 2,821 |
6 | distributed | 1,541 |
7 | pyjanitor | 1,284 |
8 | pyvtreat | 114 |
9 | python-graphblas | 112 |
10 | graphblas-algorithms | 62 |
11 | sgkit | 0 |
Sponsored