Python webscraper

Open-source Python projects categorized as webscraper

Top 21 Python webscraper Projects

  • rightmove_webscraper.py

    Python class to scrape data from rightmove.co.uk and return listings in a pandas DataFrame object

  • Scout Monitoring

    Free Django app performance insights with Scout Monitoring. Get Scout setup in minutes, and let us sweat the small stuff. A couple lines in settings.py is all you need to start monitoring your apps. Sign up for our free tier today.

    Scout Monitoring logo
  • Stocker

    Financial Web Scraper & Sentiment Classifier (by dwallach1)

  • iSubRip

    A Python command-line tool for scraping and downloading subtitles from AppleTV and iTunes movie pages.

  • CoWin-Vaccine-Notifier

    Automated Python Script to retrieve vaccine slots availability and get notified when a slot is available.

  • Jobs_LinkedIn

    Finds Jobs on LinkedIn using web-scraping

  • SearchifyX

    Fast flashcard searcher study tool

  • scraperx

    Library for scraping websites or apis at any scale

  • InfluxDB

    Purpose built for real-time analytics at any scale. InfluxDB Platform is powered by columnar analytics, optimized for cost-efficient storage, and built with open data standards.

    InfluxDB logo
  • letterboxdpy

    A letterboxd webscraper

  • bedrock-agents-webscraper

    This repo provides guidance on setting up a bedrock agent to webscrape and internet search via action groups

    Project mention: Adding Web Scraping and Google Search to AWS Bedrock Agents | dev.to | 2024-07-17

    There is a web scraper example from AWS that covers this, but I wanted to make a version for NodeJS in TypeScript. I also wasn't happy with the Google search capability relying on web scraping, so I swapped it out for the Google custom search API. My solution will also be making use of the AWS CLI and Docker images to make things more consistent.

  • otakuapuri

    Otakuapuri is a manga downloader and anime streaming application that provides an easy and convenient platform for manga and anime enthusiasts. Users can download their favorite manga in PDF format and stream their favorite anime series.

  • kicktipp-bot

    A bot which can submit tips for a Kicktipp competition based on quotes.

  • raspberry-pi-stock-checker

    A configurable python webscraper that checks raspberry pi stocks from verified sellers

  • web-scraper

    This project is a Flask-based web application designed to scrape various types of content from a specified URL. (by mandarwagh9)

    Project mention: Show HN: Scrape any website with just url, with WebUI or Terminal | news.ycombinator.com | 2024-08-21
  • YellowPage-scraper

    A YellowPage scraper is a Python program/script that extracts data from the YellowPages.com website using the Python programming language. The scraper can be used to gather information such as business names, addresses, phone numbers, emails and reviews from the YellowPages website.

    Project mention: Private business directory website | /r/selfhosted | 2023-12-03

    Hi. I am looking to host a private business directory for an community of entrepreneurs, similar to www.yellowpages.com. Private as in protected by a pin or something. Got any suggestions?

  • ti_scraper

    Highly configurable scripts for a web scraper intended to be used for cyber threat intelligence

    Project mention: Adding Proxy to existing Scraper | /r/webscraping | 2023-11-04

    because I'm not a developer, I took this project https://github.com/sandra-liedtke/ti_scraper to help me.

  • PotParser

    Python package which allows you to scrape information about cannabis strains and calculate the amount of THC or CBD in a given amount of flower

  • jobsearch-python-webcrawler

    Automatically scrap all the job titles and links, then organize into an excel file. Using Selenium and Openpyxl to scrape the information.

  • webcrawler

    This repository contains Python code for web crawling. It is built using the BeautifulSoup library and allows you to extract text from web pages and store it in text files. The crawler can also extract hyperlinks from web pages and crawl them recursively.This code will be a great starting point for your own web scraping projects

  • CL-Checker

    An automated Craigslist webscrapper, written in Python for Windows.

  • udemy-price-tracker

    A Udemy web scraper that scrapes prices.

  • poolbooru_gelscraper

    a simple python script for scraping images off gelbooru pools.

  • SaaSHub

    SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives

    SaaSHub logo
NOTE: The open source projects on this list are ordered by number of github stars. The number of mentions indicates repo mentiontions in the last 12 Months or since we started tracking (Dec 2020).

Python webscraper discussion

Log in or Post with

Python webscraper related posts

  • Private business directory website

    1 project | /r/selfhosted | 3 Dec 2023
  • How do you get girl clothes in secret

    1 project | /r/MtF | 12 Jul 2023
  • I have made a simple webscraper in python.pls checkout this github project.

    1 project | /r/madeinpython | 23 May 2023
  • Writing a simple Web crawler in python

    1 project | /r/u_DevGenious | 9 Apr 2023
  • Wrote an article on medium above webscraping in python

    1 project | /r/Python | 9 Apr 2023
  • Wrote a Simple webcrawler in python

    1 project | /r/PythonProjects2 | 9 Apr 2023
  • FULL GUIDE FOR EDGENUITY

    3 projects | /r/edgenuity | 2 Feb 2023
  • A note from our sponsor - Scout Monitoring
    www.scoutapm.com | 20 Sep 2024
    Get Scout setup in minutes, and let us sweat the small stuff. A couple lines in settings.py is all you need to start monitoring your apps. Sign up for our free tier today. Learn more →

Index

What are some of the best open-source webscraper projects in Python? This list will help you:

Project Stars
1 rightmove_webscraper.py 251
2 Stocker 148
3 iSubRip 111
4 CoWin-Vaccine-Notifier 106
5 Jobs_LinkedIn 67
6 SearchifyX 64
7 scraperx 53
8 letterboxdpy 46
9 bedrock-agents-webscraper 24
10 otakuapuri 18
11 kicktipp-bot 14
12 raspberry-pi-stock-checker 13
13 web-scraper 11
14 YellowPage-scraper 9
15 ti_scraper 7
16 PotParser 5
17 jobsearch-python-webcrawler 3
18 webcrawler 2
19 CL-Checker 2
20 udemy-price-tracker 1
21 poolbooru_gelscraper 1

Sponsored
Free Django app performance insights with Scout Monitoring
Get Scout setup in minutes, and let us sweat the small stuff. A couple lines in settings.py is all you need to start monitoring your apps. Sign up for our free tier today.
www.scoutapm.com

Did you konow that Python is
the 1st most popular programming language
based on number of metions?