Python beautifulsoup4

Open-source Python projects categorized as beautifulsoup4

Top 23 Python beautifulsoup4 Projects

beautifulsoup4
  1. JobFunnel

    Scrape job websites into a single spreadsheet with no duplicates.

    Project mention: Show HN: Scraper for job listings directly from company websites | news.ycombinator.com | 2024-12-07

    jobfunnel is FOSS and accepting contributions: https://github.com/PaulMcInnis/JobFunnel

    Currently supports indeed, in the past supported glassdoor and others.

  2. InfluxDB

    InfluxDB – Built for High-Performance Time Series Workloads. InfluxDB 3 OSS is now GA. Transform, enrich, and act on time series data directly in the database. Automate critical tasks and eliminate the need to move data externally. Download now.

    InfluxDB logo
  3. PornHub-downloader-python

    Download stuff from PH the easy way.

  4. facebook-post-scraper

    Facebook Post Scraper πŸ•΅οΈπŸ–±οΈ

  5. scrape-google-scholar-py

    Extract data from all Google Scholar pages from a single Python module. NOTE: I'm no longer maintaining this repo. Chrome driver/selectors might need and update.

  6. Quest

    This is a web app that integrates GPT-3 with google searches (by farrael004)

  7. AmazonMe

    Introducing AmazonMe, a Python-based web scraper designed to extract data from amazon.com using the requests and beautifulSoup libraries. It simplifies navigation and makes it easy to gather information from Amazon’s website efficiently.

  8. amazon_wishlist_pricewatch

    Periodically check your public Amazon wishlist for price reductions.

  9. SaaSHub

    SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives

    SaaSHub logo
  10. permaculture

    Permaculture design app built on scraped plant databases. Drag-n-drop GUI with detailed design plan generator.

  11. cf-ai-lora-news-summarizer

    Python webapp that summarizes news with Cloudflare Workers AI LoRA, Mistral, Beautifulsoup, and Streamlit

    Project mention: Summarize articles with Cloudflare Workers AI LoRAs | dev.to | 2024-07-12
  12. supremebot

    SupremeBot is a user-friendly bot built with NiceGUI to help you buy limited-edition Supreme items. It offers real-time item updates, a hype counter, and fast checkout with pre-filled forms. Available on Windows, macOS, and Linux for seamless Supreme shopping.

    Project mention: πŸš€ Introducing Supreme Bot: A Python-Based Web Automation Tool πŸ›’ | dev.to | 2025-03-15

    How You Can Help: 🌱 Contribute code: Fix bugs, add features, or refactor existing parts. πŸ’¬ Provide feedback: Let me know your thoughts and suggestions! πŸ“ˆ Help grow the community: Share the project with others who might be interested. Check out the repository: Supreme Bot GitHub

  13. rango

    Telegram bot to download torrent. (by kaushalpurohit)

  14. telexkcdbot

    A functional asynchronous telegram-bot for handy reading xkcd comics. https://t.me/telexkcdbot

  15. larentals

    An interactive map of for-sale & rental property listings in Los Angeles County, updated weekly.

  16. simple-web-scraper

    Simple web scraper to get player data using beatiful-soup4 and PostgreSQL as a database. SQLAlchemy as an ORM

  17. dicer

    Web Scraper to scrape jobs data from www.dice.com

  18. hu-announcement-bot

    Get the latest from Hacettepe with this amazing Telegram Bot!

  19. reactjs-docs-ebook

    Exports React docs as EPUB ebook.

  20. HackerNEWS-Simplified

    A more simplified, straightforward, and plain version of Hacker News.

  21. web-scraping-with-python

    Demonstration of Web Scraping using Selenium Python (Pytest & Pyunit) and Beautiful Soup

  22. acgn-bot

    Telegram bot: Check anime/comic/game/novel websites update

  23. beautifulday

    Learning project for scraping weather from weather.gc.ca. Print out simple or extended weather reports for any Canadian city to a console.

  24. goodreads2libby

    Search your Libby libraries for your books on GoodReads

  25. yellowpage-scraper

    It is create to scrape yellowpages.com (by Bibekbd)

  26. SaaSHub

    SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives

    SaaSHub logo
NOTE: The open source projects on this list are ordered by number of github stars. The number of mentions indicates repo mentiontions in the last 12 Months or since we started tracking (Dec 2020).

Python beautifulsoup4 discussion

Log in or Post with

Python beautifulsoup4 related posts

  • I create my first webscraping for yellowpages.com

    2 projects | /r/webscraping | 16 Jun 2023
  • New L.A. County rental listings, week of 6-12-2023

    1 project | /r/LARentals | 12 Jun 2023
  • New L.A. County rental listings, week of 6-12-2023

    1 project | /r/LAlist | 12 Jun 2023
  • We will NOT be participating in the blackout.

    1 project | /r/LARentals | 11 Jun 2023
  • Looking for 2 bedroom apartment/condo in West-East Hollywood.

    1 project | /r/LARentals | 8 Jun 2023
  • Looking for a rental apartment

    1 project | /r/LARentals | 11 May 2023
  • Scrape Google Scholar in R

    2 projects | dev.to | 6 May 2023
  • A note from our sponsor - InfluxDB
    www.influxdata.com | 20 May 2025
    InfluxDB 3 OSS is now GA. Transform, enrich, and act on time series data directly in the database. Automate critical tasks and eliminate the need to move data externally. Download now. Learn more β†’

Index

What are some of the best open-source beautifulsoup4 projects in Python? This list will help you:

# Project Stars
1 JobFunnel 2,012
2 PornHub-downloader-python 821
3 facebook-post-scraper 330
4 scrape-google-scholar-py 101
5 Quest 74
6 AmazonMe 64
7 amazon_wishlist_pricewatch 29
8 permaculture 27
9 cf-ai-lora-news-summarizer 27
10 supremebot 24
11 rango 23
12 telexkcdbot 23
13 larentals 22
14 simple-web-scraper 9
15 dicer 8
16 hu-announcement-bot 8
17 reactjs-docs-ebook 6
18 HackerNEWS-Simplified 6
19 web-scraping-with-python 5
20 acgn-bot 4
21 beautifulday 3
22 goodreads2libby 3
23 yellowpage-scraper 2

Sponsored
InfluxDB – Built for High-Performance Time Series Workloads
InfluxDB 3 OSS is now GA. Transform, enrich, and act on time series data directly in the database. Automate critical tasks and eliminate the need to move data externally. Download now.
www.influxdata.com

Did you know that Python is
the 2nd most popular programming language
based on number of references?