Shell Hadoop

Open-source Shell projects categorized as Hadoop

Top 3 Shell Hadoop Projects

  1. docker-hadoop

    Apache Hadoop docker image

  2. InfluxDB

    InfluxDB – Built for High-Performance Time Series Workloads. InfluxDB 3 OSS is now GA. Transform, enrich, and act on time series data directly in the database. Automate critical tasks and eliminate the need to move data externally. Download now.

    InfluxDB logo
  3. winutils

    winutils.exe hadoop.dll and hdfs.dll binaries for hadoop windows (by cdarlint)

  4. NiFItoKafkaConnect

    NiFi -> Kafka Connect -> HDFS

NOTE: The open source projects on this list are ordered by number of github stars. The number of mentions indicates repo mentiontions in the last 12 Months or since we started tracking (Dec 2020).

Shell Hadoop discussion

Log in or Post with

Shell Hadoop related posts

  • Navigating the Data Jungle. Data Analysis Software: A Comprehensive Guide

    2 projects | dev.to | 6 Jun 2024
  • Unable to write dataframe to files using PySpark on Pycharm

    1 project | /r/apachespark | 11 Dec 2023
  • Install Hadoop for Beginner

    1 project | /r/dataengineering | 7 Nov 2021
  • Free Spark dev environment on Local?

    2 projects | /r/dataengineering | 20 Aug 2021
  • Getting Started with the latest version of Apache Spark using Python and Scala in your local PC using Intellij , Windows, Mac , Linux Databricks and Apache Zeppelin.

    1 project | /r/Stream2Learn | 8 Jul 2021
  • Hadoop on M1 Mac?

    1 project | /r/dataengineering | 26 Apr 2021
  • An Overview of Lambda Architecture

    1 project | dev.to | 23 Feb 2021
  • A note from our sponsor - SaaSHub
    www.saashub.com | 14 May 2025
    SaaSHub helps you find the best software and product alternatives Learn more →

Index

What are some of the best open-source Hadoop projects in Shell? This list will help you:

# Project Stars
1 docker-hadoop 2,256
2 winutils 2,055
3 NiFItoKafkaConnect 3

Sponsored
InfluxDB – Built for High-Performance Time Series Workloads
InfluxDB 3 OSS is now GA. Transform, enrich, and act on time series data directly in the database. Automate critical tasks and eliminate the need to move data externally. Download now.
www.influxdata.com

Did you know that Shell is
the 11th most popular programming language
based on number of references?