The APIs are flexible and easy-to-use, supporting authentication, user identity, and complex enterprise features like SSO and SCIM provisioning. Learn more →
Top 19 MapReduce Open-Source Projects
-
data-science-ipython-notebooks
Data science Python notebooks: Deep learning (TensorFlow, Theano, Caffe, Keras), scikit-learn, Kaggle, big data (Spark, Hadoop MapReduce, HDFS), matplotlib, pandas, NumPy, SciPy, Python essentials, AWS, and various command lines.
-
InfluxDB
Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
-
Redisson
Redisson - Easy Redis Java client and Real-Time Data Platform. Sync/Async/RxJava/Reactive API. Over 50 Redis based Java objects and services: Set, Multimap, SortedSet, Map, List, Queue, Deque, Semaphore, Lock, AtomicLong, Map Reduce, Bloom filter, Spring Cache, Tomcat, Scheduler, JCache API, Hibernate, RPC, local cache ...
-
WorkOS
The modern identity platform for B2B SaaS. The APIs are flexible and easy-to-use, supporting authentication, user identity, and complex enterprise features like SSO and SCIM provisioning.
-
tdigest
t-Digest data structure in Python. Useful for percentiles and quantiles, including distributed enviroments like PySpark (by CamDavidsonPilon)
-
MapReduce
An easy-to-use Map Reduce Go parallel-computing framework inspired by 2021 6.824 lab1. It supports multiple workers threads on a single machine and multiple processes on a single machine right now.
-
mapreduce
A in-process MapReduce library to help you optimizing service response time or concurrent task processing. (by kevwan)
-
mit-6.824-distributed-systems
Template repository to work on the labs from MIT 6.824 Distributed Systems course.
-
sonic-distribute
Accelerate your distributed processes with this MapReduce framework. Focus on your logic and deploy tasks to workers seamelssly.
-
elasticsearch-sheets
An experimental Google Sheets add-on to view and interact with Elasticsearch indices
-
SaaSHub
SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives
Project mention: Apache Uniffle: high performance, general purpose remote shuffle service | news.ycombinator.com | 2024-03-19
https://github.com/djordje200179/Meduce https://pkg.go.dev/github.com/djordje200179/meduce
MapReduce related posts
- Apache Uniffle: high performance, general purpose remote shuffle service
- "xAI will open source Grok"
- Groovy 🎷 Cheat Sheet - 01 Say "Hello" from Groovy
- 🦿🛴Smarcity garbage reporting automation w/ ollama
- Go concurrency simplified. Part 4: Post office as a data pipeline
-
Apache Spark VS quix-streams - a user suggested alternative
2 projects | 7 Dec 2023
- Apache Uniffle: a high performance remote shuffle service for Spark
-
A note from our sponsor - WorkOS
workos.com | 28 Apr 2024
Index
What are some of the best open-source MapReduce projects? This list will help you:
Project | Stars | |
---|---|---|
1 | Apache Spark | 38,378 |
2 | data-science-ipython-notebooks | 26,459 |
3 | Redisson | 22,706 |
4 | PowerJob | 6,457 |
5 | dpark | 2,691 |
6 | mrjob | 2,609 |
7 | dumbo | 1,034 |
8 | Mobius: C# API for Spark | 937 |
9 | tdigest | 375 |
10 | incubator-uniffle | 354 |
11 | MapReduce | 212 |
12 | mapreduce | 159 |
13 | mit-6.824-distributed-systems | 53 |
14 | sonic-distribute | 30 |
15 | goterator | 16 |
16 | go-strm | 13 |
17 | slice | 9 |
18 | elasticsearch-sheets | 8 |
19 | Meduce | 3 |
Sponsored