Python Beautifulsoup

Open-source Python projects categorized as Beautifulsoup

Top 23 Python Beautifulsoup Projects

Beautifulsoup
  1. requests-html

    Pythonic HTML Parsing for Humans™

  2. InfluxDB

    InfluxDB – Built for High-Performance Time Series Workloads. InfluxDB 3 OSS is now GA. Transform, enrich, and act on time series data directly in the database. Automate critical tasks and eliminate the need to move data externally. Download now.

    InfluxDB logo
  3. crawlee-python

    Crawlee—A web scraping and browser automation library for Python to build reliable crawlers. Extract data for AI, LLMs, RAG, or GPTs. Download HTML, PDF, JPG, PNG, and other files from websites. Works with BeautifulSoup, Playwright, and raw HTTP. Both headful and headless mode. With proxy rotation.

    Project mention: How to scrape TikTok using Python | dev.to | 2025-04-30

    Which hashtags are trending now? What is an influencer's engagement rate? What topics are important for a content creator? You can find answers to these and many other questions by analyzing TikTok data. However, for analysis, you need to extract the data in a convenient format. In this blog, we'll explore how to scrape TikTok using Crawlee for Python.

  4. MechanicalSoup

    A Python library for automating interaction with websites.

    Project mention: 11 best open-source web crawlers and scrapers in 2024 | dev.to | 2024-10-29

    Language: Python | GitHub: 4.7K+ stars | link

  5. JobFunnel

    Scrape job websites into a single spreadsheet with no duplicates.

    Project mention: Show HN: Scraper for job listings directly from company websites | news.ycombinator.com | 2024-12-07

    jobfunnel is FOSS and accepting contributions: https://github.com/PaulMcInnis/JobFunnel

    Currently supports indeed, in the past supported glassdoor and others.

  6. tiktok-downloader

    Tiktok Downloader/Scraper using requests & bs4

  7. Senpwai

    A desktop app for tracking and batch downloading anime

  8. soupsieve

    A modern CSS selector implementation for BeautifulSoup

    Project mention: Release 0.44.0 of Spellcheck (GitHub) Action - baby-steps maintenance | dev.to | 2024-10-25

    soupsieve bumped from version 2.5 to 2.6, see release notes

  9. SaaSHub

    SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives

    SaaSHub logo
  10. languagepod101-scraper

    Python scraper for Language Pods such as Japanesepod101.com :japanese_ogre: :japan: :sushi: Compatible with Japanese, Chinese, French, German, Italian, Korean, Portuguese, Russian, Spanish and many more! ✨

  11. WhatSoup

    A web scraper that exports your entire WhatsApp chat history.

  12. web_to_obsidian

    A Python 3 script that scrapes an html/xml page to extract text, then creates markdown files for Obsidian & the dataview plugin

  13. cf-ai-lora-news-summarizer

    Python webapp that summarizes news with Cloudflare Workers AI LoRA, Mistral, Beautifulsoup, and Streamlit

    Project mention: Summarize articles with Cloudflare Workers AI LoRAs | dev.to | 2024-07-12
  14. reddit-bots

    A collection of Reddit bots that I use to enhance the subreddits I manage.

  15. tweet-transcriber

    A Reddit bot that transcribes tweets from comments and submissions links, mirrors their images and replies back with a formatted Markdown message.

  16. Amazon-Product-Information-Scraper

    This Python web-scraping project retrieves product names, prices, review stars, and review counts for a specific product category.

  17. DDD

    🎧 CLI Python tool for bulk downloading Darknet Diaries podcast. Hate being online? This is the way. (by Psyhackological)

  18. audioflow

    Open Source Audio News Subscription Service (Google Trends, Hacker News & more).

    Project mention: An Open Source Audio News Subscription Service | news.ycombinator.com | 2025-03-25
  19. ScoreCast

    http://scorecast-env.eba-dixbcmhw.eu-central-1.elasticbeanstalk.com/

  20. tabroom-API

    tournaments.tech's API for scraping tabroom.com

  21. python-web-scraping-primjeri

    web scraping stranica posta.hr, konzum.hr, index.hr, njuskalo.hr, neostar.com, DasWeltAuto.hr, ...

  22. web-scraping-with-python

    Demonstration of Web Scraping using Selenium Python (Pytest & Pyunit) and Beautiful Soup

  23. statum

    🗺️ A data-driven website oriented around Twitch. Written in Python + Flask, with MongoDB.

  24. python_portfolio_web_scraper-spotrac

    Python solution to webscrape contract data from https://www.spotrac.com

  25. flannelfynet

    find your fantano score.

  26. SaaSHub

    SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives

    SaaSHub logo
NOTE: The open source projects on this list are ordered by number of github stars. The number of mentions indicates repo mentiontions in the last 12 Months or since we started tracking (Dec 2020).

Python Beautifulsoup discussion

Log in or Post with

Python Beautifulsoup related posts

  • How to scrape Google Maps data using Python and Crawlee

    2 projects | dev.to | 30 Dec 2024
  • How to scrape Google search results with Python

    2 projects | dev.to | 1 Dec 2024
  • How to scrape infinite scrolling webpages with Python

    2 projects | dev.to | 27 Aug 2024
  • How to scrape a website with Python (Beginner tutorial)

    1 project | dev.to | 22 Feb 2024
  • Nastavak analize tržišta rabljenih auta - novi auti su preskupi, a porasle su cijene i rabljenima ?

    1 project | /r/croatia | 14 Apr 2023
  • Kratka analiza strujića na Njuškalu - detalji u komentaru

    1 project | /r/croatia | 5 Apr 2023
  • flannelfy.net update - LastFM, All Scores

    1 project | /r/fantanoforever | 28 Mar 2023
  • A note from our sponsor - InfluxDB
    www.influxdata.com | 15 May 2025
    InfluxDB 3 OSS is now GA. Transform, enrich, and act on time series data directly in the database. Automate critical tasks and eliminate the need to move data externally. Download now. Learn more →

Index

What are some of the best open-source Beautifulsoup projects in Python? This list will help you:

# Project Stars
1 requests-html 13,806
2 crawlee-python 5,638
3 MechanicalSoup 4,752
4 JobFunnel 2,010
5 tiktok-downloader 321
6 Senpwai 246
7 soupsieve 236
8 languagepod101-scraper 160
9 WhatSoup 141
10 web_to_obsidian 54
11 cf-ai-lora-news-summarizer 27
12 reddit-bots 25
13 tweet-transcriber 18
14 Amazon-Product-Information-Scraper 15
15 DDD 13
16 audioflow 13
17 ScoreCast 11
18 tabroom-API 11
19 python-web-scraping-primjeri 6
20 web-scraping-with-python 5
21 statum 5
22 python_portfolio_web_scraper-spotrac 5
23 flannelfynet 4

Sponsored
InfluxDB – Built for High-Performance Time Series Workloads
InfluxDB 3 OSS is now GA. Transform, enrich, and act on time series data directly in the database. Automate critical tasks and eliminate the need to move data externally. Download now.
www.influxdata.com

Did you know that Python is
the 2nd most popular programming language
based on number of references?