apache logo

Apache Hadoop

Apache Hadoop (by apache)

Stats

Basic Apache Hadoop repo stats
1
11,503
9.7
4 days ago

apache/hadoop is an open source project licensed under Apache License 2.0 which is an OSI approved license.

Apache Hadoop Alternatives

Similar projects and alternatives to Apache Hadoop based on common topics and language

  • GitHub repo Deeplearning4j

    Model import deployment framework for retraining models (pytorch, tensorflow,keras) deploying in JVM Micro service environments, mobile devices, iot, and Apache Spark

  • GitHub repo Presto

    The official home of the Presto distributed SQL query engine for big data

  • GitHub repo Alluxio (formerly Tachyon)

    Alluxio, data orchestration for analytics and machine learning in the cloud

  • GitHub repo Apache Ignite

    Apache Ignite (by apache)

  • GitHub repo Apache Hive

    Apache Hive

  • GitHub repo Trino

    Official repository of Trino, the distributed SQL query engine for big data, formerly known as PrestoSQL (https://trino.io)

  • GitHub repo presto

    Official repository of Trino, the distributed SQL query engine for big data, formerly known as PrestoSQL (https://trino.io) [Moved to: https://github.com/trinodb/trino] (by prestosql)

NOTE: The number of mentions on this list indicates mentions on common posts. Hence, a higher number means a better Apache Hadoop alternative or higher similarity.

Posts

Posts where Apache Hadoop has been mentioned. We have used some of these posts to build our list of alternatives and similar projects.
  • Currently in Data Science. Should I make the move?
    It'd be best to clarify exactly what we mean by "Hadoop", but if we define it as the suite described here then the only components I still see being used for greenfield are HDFS - or, to be more specific, HDFS-compatible filesystems (AWS EMR and Azure Data Lake Storage both offer HDFS compatibility) - and maybe (Spark) YARN.