Python MapReduce

Open-source Python projects categorized as MapReduce

Top 4 Python MapReduce Projects

  1. data-science-ipython-notebooks

    Data science Python notebooks: Deep learning (TensorFlow, Theano, Caffe, Keras), scikit-learn, Kaggle, big data (Spark, Hadoop MapReduce, HDFS), matplotlib, pandas, NumPy, SciPy, Python essentials, AWS, and various command lines.

  2. CodeRabbit

    CodeRabbit: AI Code Reviews for Developers. Revolutionize your code reviews with AI. CodeRabbit offers PR summaries, code walkthroughs, 1-click suggestions, and AST-based analysis. Boost productivity and code quality across all major languages with each PR.

    CodeRabbit logo
  3. mrjob

    Run MapReduce jobs on Hadoop or Amazon Web Services

  4. dumbo

    Python module that allows one to easily write and run Hadoop programs.

  5. tdigest

    t-Digest data structure in Python. Useful for percentiles and quantiles, including distributed enviroments like PySpark (by CamDavidsonPilon)

NOTE: The open source projects on this list are ordered by number of github stars. The number of mentions indicates repo mentiontions in the last 12 Months or since we started tracking (Dec 2020).

Python MapReduce discussion

Log in or Post with

Python MapReduce related posts

  • Intro to Ray on GKE

    3 projects | dev.to | 12 Sep 2024
  • 🦿🛴Smarcity garbage reporting automation w/ ollama

    6 projects | dev.to | 31 Jan 2024
  • Deequ for generating data quality reports

    3 projects | dev.to | 24 Nov 2022
  • Machine Learning Pipelines with Spark: Introductory Guide (Part 1)

    5 projects | dev.to | 23 Oct 2022
  • A peek into Location Data Science at Ola

    6 projects | dev.to | 26 Sep 2022
  • Best Open source no-code ELT tool for startup

    5 projects | /r/dataengineering | 29 Aug 2022
  • How to use Spark and Pandas to prepare big data

    3 projects | dev.to | 10 May 2022
  • A note from our sponsor - CodeRabbit
    coderabbit.ai | 25 Mar 2025
    Revolutionize your code reviews with AI. CodeRabbit offers PR summaries, code walkthroughs, 1-click suggestions, and AST-based analysis. Boost productivity and code quality across all major languages with each PR. Learn more →

Index

What are some of the best open-source MapReduce projects in Python? This list will help you:

# Project Stars
1 data-science-ipython-notebooks 27,993
2 mrjob 2,618
3 dumbo 1,033
4 tdigest 390

Sponsored
CodeRabbit: AI Code Reviews for Developers
Revolutionize your code reviews with AI. CodeRabbit offers PR summaries, code walkthroughs, 1-click suggestions, and AST-based analysis. Boost productivity and code quality across all major languages with each PR.
coderabbit.ai

Did you know that Python is
the 2nd most popular programming language
based on number of references?