Spark-Crawler: Apache Nutch-like crawler that runs on Apache Spark. (by USCDataScience)


Basic Sparkler repo stats
8 days ago

USCDataScience/sparkler is an open source project licensed under Apache License 2.0 which is an OSI approved license.

Sparkler Alternatives

Similar projects and alternatives to Sparkler based on common topics and language

  • GitHub repo Apache Solr

    Apache Lucene and Solr open-source search software

  • GitHub repo Apache Nutch

    Apache Nutch is an extensible and scalable web crawler

  • GitHub repo LuceneBench

    Lucene Benchmark : benchmarking Lucene vs. SeekStorm

  • GitHub repo storm-crawler

    A scalable, mature and versatile web crawler based on Apache Storm

  • GitHub repo Elasticsearch

    Free and Open, Distributed, RESTful Search Engine

  • GitHub repo OpenSearch

    Open source distributed and RESTful search engine. (by opensearch-project)

  • GitHub repo Zeppelin

    Web-based notebook that enables data-driven, interactive data analytics and collaborative documents with SQL, Scala and more.

NOTE: The number of mentions on this list indicates mentions on common posts. Hence, a higher number means a better Sparkler alternative or higher similarity.


Posts where Sparkler has been mentioned. We have used some of these posts to build our list of alternatives and similar projects.

We don't know posts mentioning Sparkler yet. We started tracking mentions in Dec 2020.