The APIs are flexible and easy-to-use, supporting authentication, user identity, and complex enterprise features like SSO and SCIM provisioning. Learn more →
Top 5 Python MapReduce Projects
-
data-science-ipython-notebooks
Data science Python notebooks: Deep learning (TensorFlow, Theano, Caffe, Keras), scikit-learn, Kaggle, big data (Spark, Hadoop MapReduce, HDFS), matplotlib, pandas, NumPy, SciPy, Python essentials, AWS, and various command lines.
-
WorkOS
The modern identity platform for B2B SaaS. The APIs are flexible and easy-to-use, supporting authentication, user identity, and complex enterprise features like SSO and SCIM provisioning.
-
tdigest
t-Digest data structure in Python. Useful for percentiles and quantiles, including distributed enviroments like PySpark (by CamDavidsonPilon)
-
InfluxDB
Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
Python MapReduce related posts
- 🦿🛴Smarcity garbage reporting automation w/ ollama
- Deequ for generating data quality reports
- Machine Learning Pipelines with Spark: Introductory Guide (Part 1)
- A peek into Location Data Science at Ola
- Best Open source no-code ELT tool for startup
- How to use Spark and Pandas to prepare big data
- How to use Spark and Pandas to prepare big data
-
A note from our sponsor - WorkOS
workos.com | 25 Apr 2024
Index
What are some of the best open-source MapReduce projects in Python? This list will help you:
Project | Stars | |
---|---|---|
1 | data-science-ipython-notebooks | 26,459 |
2 | dpark | 2,691 |
3 | mrjob | 2,609 |
4 | dumbo | 1,034 |
5 | tdigest | 375 |
Sponsored