SaaSHub helps you find the best software and product alternatives Learn more →
Similar projects and alternatives to Dask
Apache Airflow - A platform to programmatically author, schedule, and monitor workflows
Flexible and powerful data analysis / manipulation library for Python, providing labeled data structures similar to R data.frame objects, statistical functions, and much more
Write Clean Python Code. Always.. Sonar helps you commit clean code every time. With over 225 unique rules to find Python bugs, code smells & vulnerabilities, Sonar finds the issues while you focus on the work.
NumPy aware dynamic Python compiler using LLVM
Network Analysis in Python
A Python framework for creating reproducible, maintainable and modular data science code.
Interactive Parallel Computing with IPython
IPython Parallel: Interactive Parallel Computing in Python
Statsmodels: statistical modeling and econometrics in Python
Access the most powerful time series database as a service. Ingest, store, & analyze all types of time series data in a fully-managed, purpose-built database. Keep data forever with low-cost storage and superior data compression.
Ray is a unified framework for scaling AI and Python applications. Ray consists of a core distributed runtime and a toolkit of libraries (Ray AIR) for accelerating ML workloads.
NumPy and Pandas interface to Big Data
Bayesian Modeling in Python
Scrapy, a fast high-level web crawling & scraping framework for Python.
Apache Spark - A unified analytics engine for large-scale data processing
ClickHouse® is a free analytics DBMS for big data
💫 Industrial-strength Natural Language Processing (NLP) in Python
Python packaging and dependency management made easy
The fundamental package for scientific computing with Python.
Luigi is a Python module that helps you build complex pipelines of batch jobs. It handles dependency resolution, workflow management, visualization etc. It also comes with Hadoop support built in.
Nim is a statically typed compiled systems programming language. It combines successful concepts from mature languages like Python, Ada and Modula. Its design focuses on efficiency, expressiveness, and elegance (in that order of priority).
Data validation using Python type hints
SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives
Dask reviews and mentions
A peek into Location Data Science at Ola
6 projects | dev.to | 26 Sep 2022
Data scientists work on phenomenally large datasets, and Dask is a handy tool for exploration within the confines of a single cloud VM or their local PCs. Location data visualization is an essential part of deciding further algorithm development and roadmap for projects. This lays the foundation for data engineering and science to work at scale, with petabytes of data.
File format for large data with many columns
2 projects | reddit.com/r/Python | 15 May 2022
What is the best way to save a csv.file in number only ? PC hangs when my file is more than 2GB
2 projects | reddit.com/r/learnpython | 4 Apr 2022
Large Scale Hydrology: Geocomputational tools that you use
3 projects | reddit.com/r/Hydrology | 13 Feb 2022
We're using a lot of Python. In addition to these, gridMET, Dask, HoloViz, and kerchunk.
msgspec - a fast & friendly JSON/MessagePack library
4 projects | reddit.com/r/Python | 10 Feb 2022
I wrote this for speeding up the RPC messaging in dask, but figured it might be useful for others as well. The source is available on github here: https://github.com/jcrist/msgspec.
What does it mean to scale your python powered pipeline?
4 projects | dev.to | 3 Jan 2022
Dask: Distributed data frames, machine learning and more
Data pipelines with Luigi
4 projects | dev.to | 22 Dec 2021
To do that, we are efficiently using Dask, simply creating on-demand local (or remote) clusters on task run() method:
Dask – a flexible library for parallel computing in Python
8 projects | news.ycombinator.com | 17 Nov 2021
Distributed computing in python??
2 projects | reddit.com/r/learnpython | 9 Nov 2021
Show HN: Hamilton, a Microframework for Creating Dataframes
6 projects | news.ycombinator.com | 8 Nov 2021
This project reminds me a lot of Dask https://dask.org/. A library that allows delayed calculation of complex dataframes in Python.
A note from our sponsor - #<SponsorshipServiceOld:0x00007f160d5981f0>
www.saashub.com | 24 Mar 2023
dask/dask is an open source project licensed under BSD 3-clause "New" or "Revised" License which is an OSI approved license.