Dask

Open-source projects categorized as Dask

Top 23 Dask Open-Source Projects

  • Dask

    Parallel computing with task scheduling

    Project mention: The Distributed Tensor Algebra Compiler (2022) | news.ycombinator.com | 2023-06-15
  • cudf

    cuDF - GPU DataFrame Library

    Project mention: A Polars exploration into Kedro | dev.to | 2023-05-17

    The interesting thing about Polars is that it does not try to be a drop-in replacement to pandas, like Dask, cuDF, or Modin, and instead has its own expressive API. Despite being a young project, it quickly got popular thanks to its easy installation process and its “lightning fast” performance.

  • InfluxDB

    Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

  • ibis

    the portable Python dataframe library

    Project mention: This Week In Python | dev.to | 2024-03-17

    ibis – portable Python dataframe library

  • xarray

    N-D labeled arrays and datasets in Python

  • stumpy

    STUMPY is a powerful and scalable Python library for modern time series analysis

    Project mention: Stumpy: Matrix profile time series analysis | news.ycombinator.com | 2024-03-03
  • mars

    Mars is a tensor-based unified framework for large-scale data computation which scales numpy, pandas, scikit-learn and Python functions.

  • swifter

    A package which efficiently applies any function to a pandas dataframe or series in the fastest available manner (by jmcarpenter2)

  • WorkOS

    The modern identity platform for B2B SaaS. The APIs are flexible and easy-to-use, supporting authentication, user identity, and complex enterprise features like SSO and SCIM provisioning.

  • fugue

    A unified interface for distributed computing. Fugue executes SQL, Python, Pandas, and Polars code on Spark, Dask and Ray without any rewrites.

    Project mention: FLaNK Stack Weekly 22 January 2024 | dev.to | 2024-01-22
  • distributed

    A distributed task scheduler for Dask

  • Optimus

    :truck: Agile Data Preparation Workflows made easy with Pandas, Dask, cuDF, Dask-cuDF, Vaex and PySpark (by ironmussa)

  • Eliot

    Eliot: the logging system that tells you *why* it happened

  • mlforecast

    Scalable machine 🤖 learning for time series forecasting.

    Project mention: Sales forecast for next two years | /r/datascience | 2023-06-25

    MLForecast

  • pystore

    Fast data store for Pandas time-series data

  • dask-sql

    Distributed SQL Engine in Python using Dask

    Project mention: FLaNK Stack Weekly for 20 June 2023 | dev.to | 2023-06-20
  • nebari

    🪴 Nebari - your open source data science platform (by nebari-dev)

  • amazon-sagemaker-local-mode

    Amazon SageMaker Local Mode Examples

    Project mention: Debugging Python Code in Amazon SageMaker Locally Using Visual Studio Code and PyCharm: A Step-by-Step Guide | dev.to | 2023-11-15

    git clone https://github.com/aws-samples/amazon-sagemaker-local-mode/ cd amazon-sagemaker-local-mode/general_pipeline_local_debug python3 -m venv .venv source .venv/bin/activate pip install jupyter jupyter lab

  • stackstac

    Turn a STAC catalog into a dask-based xarray

  • aicsimageio

    Image Reading, Metadata Conversion, and Image Writing for Microscopy Images in Python

  • orochi

    The Volatility Collaborative GUI

  • xgboost_ray

    Distributed XGBoost on Ray

  • bytehub

    ByteHub: making feature stores simple

  • dask-awkward

    Native Dask collection for awkward arrays, and the library to use it.

  • coiled-resources

    Notebooks that support blog posts and tech talks on Dask / Coiled.

  • SaaSHub

    SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives

NOTE: The open source projects on this list are ordered by number of github stars. The number of mentions indicates repo mentiontions in the last 12 Months or since we started tracking (Dec 2020). The latest post mention was on 2024-03-17.

Dask related posts

Index

What are some of the best open-source Dask projects? This list will help you:

Project Stars
1 Dask 11,965
2 cudf 7,257
3 ibis 4,041
4 xarray 3,399
5 stumpy 2,984
6 mars 2,673
7 swifter 2,456
8 fugue 1,869
9 distributed 1,539
10 Optimus 1,441
11 Eliot 1,081
12 mlforecast 707
13 pystore 527
14 dask-sql 363
15 nebari 255
16 amazon-sagemaker-local-mode 227
17 stackstac 222
18 aicsimageio 191
19 orochi 189
20 xgboost_ray 131
21 bytehub 57
22 dask-awkward 56
23 coiled-resources 40
SaaSHub - Software Alternatives and Reviews
SaaSHub helps you find the best software and product alternatives
www.saashub.com