Revolutionize your code reviews with AI. CodeRabbit offers PR summaries, code walkthroughs, 1-click suggestions, and AST-based analysis. Boost productivity and code quality across all major languages with each PR. Learn more →
Top 4 Python MapReduce Projects
-
data-science-ipython-notebooks
Data science Python notebooks: Deep learning (TensorFlow, Theano, Caffe, Keras), scikit-learn, Kaggle, big data (Spark, Hadoop MapReduce, HDFS), matplotlib, pandas, NumPy, SciPy, Python essentials, AWS, and various command lines.
-
CodeRabbit
CodeRabbit: AI Code Reviews for Developers. Revolutionize your code reviews with AI. CodeRabbit offers PR summaries, code walkthroughs, 1-click suggestions, and AST-based analysis. Boost productivity and code quality across all major languages with each PR.
-
-
-
tdigest
t-Digest data structure in Python. Useful for percentiles and quantiles, including distributed enviroments like PySpark (by CamDavidsonPilon)
NOTE:
The open source projects on this list are ordered by number of github stars.
The number of mentions indicates repo mentiontions in the last 12 Months or
since we started tracking (Dec 2020).
Python MapReduce discussion
Python MapReduce related posts
-
Intro to Ray on GKE
-
🦿🛴Smarcity garbage reporting automation w/ ollama
-
Deequ for generating data quality reports
-
Machine Learning Pipelines with Spark: Introductory Guide (Part 1)
-
A peek into Location Data Science at Ola
-
Best Open source no-code ELT tool for startup
-
How to use Spark and Pandas to prepare big data
-
A note from our sponsor - CodeRabbit
coderabbit.ai | 25 Mar 2025
Index
What are some of the best open-source MapReduce projects in Python? This list will help you:
# | Project | Stars |
---|---|---|
1 | data-science-ipython-notebooks | 27,993 |
2 | mrjob | 2,618 |
3 | dumbo | 1,033 |
4 | tdigest | 390 |