A scalable web crawler framework for Java. (by code4craft)


Basic webmagic repo stats
15 days ago

code4craft/webmagic is an open source project licensed under Apache License 2.0 which is an OSI approved license.

Webmagic Alternatives

Similar projects and alternatives to webmagic based on common topics and language

  • GitHub repo google-search-results-java

    Google Search Results JAVA API via SerpApi

  • GitHub repo ServiceTalk

    A networking framework that evolves with your application

  • GitHub repo ActiveJ

    ActiveJ is an alternative Java platform built from the ground up. ActiveJ redefines web, high load, and cloud programming in Java, featuring ultimate performance and scalability!

  • GitHub repo Scrapy

    Scrapy, a fast high-level web crawling & scraping framework for Python.

  • GitHub repo jsoup

    jsoup: the Java HTML parser, built for HTML editing, cleaning, scraping, and XSS safety.

  • GitHub repo Apache Nutch

    Apache Nutch is an extensible and scalable web crawler

  • GitHub repo storm-crawler

    A scalable, mature and versatile web crawler based on Apache Storm

NOTE: The number of mentions on this list indicates mentions on common posts. Hence, a higher number means a better webmagic alternative or higher similarity.


Posts where webmagic has been mentioned. We have used some of these posts to build our list of alternatives and similar projects.

We don't know posts mentioning webmagic yet. We started tracking mentions in Dec 2020.