MapReduce

Open-source projects categorized as MapReduce

Top 19 MapReduce Open-Source Projects

  • Apache Spark

    Apache Spark - A unified analytics engine for large-scale data processing

    Project mention: "xAI will open source Grok" | news.ycombinator.com | 2024-03-11
  • data-science-ipython-notebooks

    Data science Python notebooks: Deep learning (TensorFlow, Theano, Caffe, Keras), scikit-learn, Kaggle, big data (Spark, Hadoop MapReduce, HDFS), matplotlib, pandas, NumPy, SciPy, Python essentials, AWS, and various command lines.

  • InfluxDB

    Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

  • Redisson

    Redisson - Easy Redis Java client with features of In-Memory Data Grid. Sync/Async/RxJava/Reactive API. Over 50 Redis based Java objects and services: Set, Multimap, SortedSet, Map, List, Queue, Deque, Semaphore, Lock, AtomicLong, Map Reduce, Bloom filter, Spring Cache, Tomcat, Scheduler, JCache API, Hibernate, RPC, local cache ...

  • PowerJob

    Enterprise job scheduling middleware with distributed computing ability.

  • dpark

    Python clone of Spark, a MapReduce alike framework in Python

  • mrjob

    Run MapReduce jobs on Hadoop or Amazon Web Services

  • dumbo

    Python module that allows one to easily write and run Hadoop programs.

  • WorkOS

    The modern identity platform for B2B SaaS. The APIs are flexible and easy-to-use, supporting authentication, user identity, and complex enterprise features like SSO and SCIM provisioning.

  • Mobius: C# API for Spark

    C# and F# language binding and extensions to Apache Spark (by microsoft)

  • tdigest

    t-Digest data structure in Python. Useful for percentiles and quantiles, including distributed enviroments like PySpark (by CamDavidsonPilon)

  • incubator-uniffle

    Uniffle is a high performance, general purpose Remote Shuffle Service.

    Project mention: Apache Uniffle: high performance, general purpose remote shuffle service | news.ycombinator.com | 2024-03-19
  • MapReduce

    An easy-to-use Map Reduce Go parallel-computing framework inspired by 2021 6.824 lab1. It supports multiple workers threads on a single machine and multiple processes on a single machine right now.

  • mapreduce

    A in-process MapReduce library to help you optimizing service response time or concurrent task processing. (by kevwan)

  • mit-6.824-distributed-systems

    Template repository to work on the labs from MIT 6.824 Distributed Systems course.

  • sonic-distribute

    Accelerate your distributed processes with this MapReduce framework. Focus on your logic and deploy tasks to workers seamelssly.

  • goterator

    Lazy iterator implementation for Golang

  • go-strm

    A rich Map/Reduce API in Go

  • slice

    Elixir's Enum module implemented in Go using generics. (by nwjlyons)

  • elasticsearch-sheets

    An experimental Google Sheets add-on to view and interact with Elasticsearch indices

  • Meduce

    MapReduce library for concurrent data processing

    Project mention: My MapReduce library | /r/golang | 2023-04-30

    https://github.com/djordje200179/Meduce https://pkg.go.dev/github.com/djordje200179/meduce

  • SaaSHub

    SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives

NOTE: The open source projects on this list are ordered by number of github stars. The number of mentions indicates repo mentiontions in the last 12 Months or since we started tracking (Dec 2020). The latest post mention was on 2024-03-19.

MapReduce related posts

Index

What are some of the best open-source MapReduce projects? This list will help you:

Project Stars
1 Apache Spark 38,104
2 data-science-ipython-notebooks 26,307
3 Redisson 22,566
4 PowerJob 6,401
5 dpark 2,691
6 mrjob 2,609
7 dumbo 1,040
8 Mobius: C# API for Spark 941
9 tdigest 374
10 incubator-uniffle 348
11 MapReduce 210
12 mapreduce 148
13 mit-6.824-distributed-systems 53
14 sonic-distribute 30
15 goterator 16
16 go-strm 13
17 slice 9
18 elasticsearch-sheets 8
19 Meduce 3
SaaSHub - Software Alternatives and Reviews
SaaSHub helps you find the best software and product alternatives
www.saashub.com