Webcrawler

Open-source projects categorized as Webcrawler

Top 10 Webcrawler Open-Source Projects

  • crawlab

    Distributed web crawler admin platform for spiders management regardless of languages and frameworks. 分布式爬虫管理平台,支持任何语言和框架

  • apollo

    A Unix-style personal search engine and web crawler for your digital footprint. (by amirgamil)

  • InfluxDB

    Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

    InfluxDB logo
  • scrapyrt

    HTTP API for Scrapy spiders

  • SpiderSuite

    Advance web security spider/crawler

  • Project mention: Show HN: SpiderSuite version 1.0.4. Whats new? | news.ycombinator.com | 2023-09-17
  • Rcrawler

    An R web crawler and scraper

  • krawler

    A web crawling framework written in Kotlin

  • wbot

    A simple & efficient web crawler.

  • SaaSHub

    SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives

    SaaSHub logo
  • Abosar

    অবসর 📚 A collection of short Bengali stories web scraped from various Bengali eMagazines and eNewspapers.

  • Spidey

    A multi threaded web crawler library that is generic enough to allow different engines to be swapped in.

  • Edupedu_crawler

    Un mic crawler in python care preia ultimele informatii despre profesori si greva de pe Edupedu si le comprima cu ajutorul lui BingAI direct in README

  • Project mention: Am facut un "pseudo-live-feed" al grevei | /r/Romania | 2023-05-28

    Link repo: https://github.com/Mike4544/Edupedu_crawler

NOTE: The open source projects on this list are ordered by number of github stars. The number of mentions indicates repo mentiontions in the last 12 Months or since we started tracking (Dec 2020).

Webcrawler related posts

  • Self-hosted web scraper?

    4 projects | /r/selfhosted | 3 Jan 2023
  • I need data from a website. It is viable to create an API that scrapes the website and returns the data on an endpoint?

    2 projects | /r/dotnet | 20 Dec 2022
  • Can R do recursive web crawling?

    1 project | /r/RStudio | 22 Dec 2022
  • CI/CD in Action: Manage auto builds of large open-source projects with GitHub Actions?

    4 projects | dev.to | 20 Oct 2022
  • CI/CD in Action: How to use Microsoft's GitHub Actions in a right way?

    2 projects | dev.to | 13 Oct 2022
  • A small web crawler - WBot

    2 projects | /r/golang | 14 May 2022
  • GitHub - amirgamil/apollo: A Unix-style personal search engine and web crawler for your digital footprint.

    1 project | /r/devopsish | 28 Jul 2021
  • A note from our sponsor - InfluxDB
    www.influxdata.com | 20 May 2024
    Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality. Learn more →

Index

What are some of the best open-source Webcrawler projects? This list will help you:

Project Stars
1 crawlab 10,872
2 apollo 1,360
3 scrapyrt 817
4 SpiderSuite 543
5 Rcrawler 344
6 krawler 130
7 wbot 17
8 Abosar 12
9 Spidey 11
10 Edupedu_crawler 2

Sponsored
SaaSHub - Software Alternatives and Reviews
SaaSHub helps you find the best software and product alternatives
www.saashub.com