Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality. Learn more →
Top 23 Python Dask Projects
-
InfluxDB
Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
-
mars
Mars is a tensor-based unified framework for large-scale data computation which scales numpy, pandas, scikit-learn and Python functions.
-
swifter
A package which efficiently applies any function to a pandas dataframe or series in the fastest available manner (by jmcarpenter2)
-
fugue
A unified interface for distributed computing. Fugue executes SQL, Python, Pandas, and Polars code on Spark, Dask and Ray without any rewrites.
-
WorkOS
The modern identity platform for B2B SaaS. The APIs are flexible and easy-to-use, supporting authentication, user identity, and complex enterprise features like SSO and SCIM provisioning.
-
Optimus
:truck: Agile Data Preparation Workflows made easy with Pandas, Dask, cuDF, Dask-cuDF, Vaex and PySpark (by ironmussa)
-
SaaSHub
SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives
Project mention: Show HN: Hashquery, a Python library for defining reusable analysis | news.ycombinator.com | 2024-04-23I really don't understand the appeal of dbt vs a proper programming language. The templating approach leads to massive spaghetti. I look forward to trying out something like Ibis [0]
0: https://ibis-project.org/
MLForecast
Project mention: Debugging Python Code in Amazon SageMaker Locally Using Visual Studio Code and PyCharm: A Step-by-Step Guide | dev.to | 2023-11-15git clone https://github.com/aws-samples/amazon-sagemaker-local-mode/ cd amazon-sagemaker-local-mode/general_pipeline_local_debug python3 -m venv .venv source .venv/bin/activate pip install jupyter jupyter lab
Python Dask related posts
- Stumpy: Matrix profile time series analysis
- Shuffling large data at constant memory in Dask
- Fugue: A unified interface for distributed computing
- [Discussion] Open Source beats Google's AutoML for Time series
- File format for large data with many columns
- Time Series Analysis for air pollution data not aligned [R] [P]
- What is the best way to save a csv.file in number only ? PC hangs when my file is more than 2GB
-
A note from our sponsor - InfluxDB
www.influxdata.com | 26 Apr 2024
Index
What are some of the best open-source Dask projects in Python? This list will help you:
Project | Stars | |
---|---|---|
1 | Dask | 11,982 |
2 | ibis | 4,074 |
3 | xarray | 3,404 |
4 | stumpy | 2,984 |
5 | mars | 2,677 |
6 | swifter | 2,459 |
7 | fugue | 1,876 |
8 | distributed | 1,541 |
9 | Optimus | 1,446 |
10 | Eliot | 1,083 |
11 | mlforecast | 713 |
12 | pystore | 539 |
13 | dask-sql | 363 |
14 | nebari | 256 |
15 | amazon-sagemaker-local-mode | 228 |
16 | stackstac | 222 |
17 | aicsimageio | 192 |
18 | xgboost_ray | 131 |
19 | bytehub | 57 |
20 | dask-awkward | 56 |
21 | dask-memusage | 24 |
22 | steam-data-engineering | 20 |
23 | pangeo-binder | 18 |
Sponsored