Python Beautifulsoup

Open-source Python projects categorized as Beautifulsoup

Top 23 Python Beautifulsoup Projects

  • requests-html

    Pythonic HTML Parsing for Humans™

    Project mention: will requests-html library work as selenium | /r/Python | 2023-02-13
  • MechanicalSoup

    A Python library for automating interaction with websites.

    Project mention: Alternatives to Selenium? | /r/pythontips | 2022-07-21

    Try with Mechanicalsoup https://mechanicalsoup.readthedocs.io/en/stable/

  • InfluxDB

    Access the most powerful time series database as a service. Ingest, store, & analyze all types of time series data in a fully-managed, purpose-built database. Keep data forever with low-cost storage and superior data compression.

  • JobFunnel

    Scrape job websites into a single spreadsheet with no duplicates.

  • http-proxy-list

    It is a lightweight project that, every 10 minutes, scrapes lots of free-proxy sites, validates if it works, and serves a clean proxy list.

    Project mention: Proxylist Sources | /r/privatepub | 2023-02-15
  • soupsieve

    A modern CSS selector implementation for BeautifulSoup

  • tiktok-downloader

    Tiktok Downloader/Scraper using requests & bs4

    Project mention: Anyone know how to bulk download a tiktok profile with no watermark and in HD? | /r/DataHoarder | 2023-05-30
  • languagepod101-scraper

    Python scraper for Language Pods such as Japanesepod101.com :japanese_ogre: :japan: :sushi: Compatible with Japanese, Chinese, French, German, Italian, Korean, Portuguese, Russian, Spanish and many more! ✨

  • Sonar

    Write Clean Python Code. Always.. Sonar helps you commit clean code every time. With over 225 unique rules to find Python bugs, code smells & vulnerabilities, Sonar finds the issues while you focus on the work.

  • WhatSoup

    A web scraper that exports your entire WhatsApp chat history.

  • web_to_obsidian

    A Python 3 script that scrapes an html/xml page to extract text, then creates markdown files for Obsidian & the dataview plugin

    Project mention: I got the job! Now using Obsidian to help create databases for my company | /r/ObsidianMD | 2023-03-25

    Made a Github page last time I posted this: https://github.com/Flybell/web_to_obsidian

  • reddit-bots

    A collection of Reddit bots that I use to enhance the subreddits I manage.

    Project mention: Any ideas? I need a bot to grab comments from a reddit post and put them on github repository | /r/github | 2023-01-05

    It's not what you're directly looking for but as an example and starting point I'd check out https://github.com/PhantomInsights/reddit-bots

  • tweet-transcriber

    A Reddit bot that transcribes tweets from comments and submissions links, mirrors their images and replies back with a formatted Markdown message.

  • DDD

    The CLI Python module for bulk downloading the Darknet Diaries podcast to a hard disc. Hate being online all the time? This is the way. (by Psyhackological)

  • PythonAutomateCybersecurity

    Course covering Task Automation and Cyber Security in Python

  • Letterboxd-friend-ranker

    Program that computes, ranks a given user and their friends based on Letterboxd ratings

    Project mention: Letterboxd Profile Analyzer, Friends Ranker, and Simple Movie Recommender App | /r/Letterboxd | 2023-03-28

    The loading bar actually comes with Streamlit and they have each of their component documentations on their page, you can access my source code for the app on my GitHub here (check on deployment.py, search for "st.spinner" and "st.progress"). For additional data of each movie, maybe you missed that part on my Profile Analysis article (the scraping movie details part). Yes, I modified scraping functions from this repo and I mentioned it on my Friends Ranker article.

  • Amazon-Product-Information-Scraper

    This is a python web-scraping project to get all the product names, price, review stars and review count of a particular category of the product

  • tabroom-API

    tournaments.tech's API for scraping tabroom.com

    Project mention: Debate Land Beta 0.2 is out! | /r/Debate | 2023-06-03

    Now does a data site for high school debate need all of that? Probably no. But we wanted to create something stunning. Our goal for Debate Land is to create something MaxPreps would be jealous of. And we've never really had a problem with the UI/UX from our feedback—our KPIs have jumped significantly since the redesign from tournaments.tech (which had a more simple layout and design).

  • statum

    🗺️ statum, a Twitch streamer-related website. Written in Python + Flask, with MongoDB. Current features include Twitch OAuth integration, personalized dashboard, unique streamer insights & much more.

  • weheartpy

    A fast, reliable API wrapper for weheartit.com

    Project mention: weheartpy: an API client for weheartit.com | /r/madeinpython | 2022-08-20

    Github

  • python-web-scraping-primjeri

    web scraping stranica posta.hr, konzum.hr, index.hr, njuskalo.hr, neostar.com, DasWeltAuto.hr, ...

    Project mention: Nastavak analize tržišta rabljenih auta - novi auti su preskupi, a porasle su cijene i rabljenima ? | /r/croatia | 2023-04-14

    Nastavak na ovu OBJAVU. Skripte su OVDJE, zajedno sa sirovim excelicama. Za potrebe ove objave, radio sam dorađenu excelicu. Varijante modela sam spajao, kako bi imao veći uzorak pojedinog modela. Npr karavan, hatchback, sedan - sve ide pod 1 model.

  • python_portfolio_web_scraper-spotrac

    Python solution to webscrape contract data from https://www.spotrac.com

    Project mention: Export Player Contracts | /r/fantasyfootballcoding | 2022-06-11

    Not sure if you know python or not but it looks like you could scrape each team page using python: https://github.com/the-data-analyst/python_portfolio_web_scraper-spotrac/

  • israbrew

    Beers from various suppliers across the state scraped onto one website

  • flannelfy_os

    A site that allows the user to have their Spotify library reviewed by music critic Anthony Fantano.

    Project mention: flannelfy.net update - LastFM, All Scores | /r/fantanoforever | 2023-03-28

    Find your score: flannelfy.net

  • GSOC_org_analysis

    Welcome to my first full project!

  • SaaSHub

    SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives

NOTE: The open source projects on this list are ordered by number of github stars. The number of mentions indicates repo mentiontions in the last 12 Months or since we started tracking (Dec 2020). The latest post mention was on 2023-06-03.

Python Beautifulsoup related posts

Index

What are some of the best open-source Beautifulsoup projects in Python? This list will help you:

Project Stars
1 requests-html 13,168
2 MechanicalSoup 4,393
3 JobFunnel 1,635
4 http-proxy-list 274
5 soupsieve 169
6 tiktok-downloader 163
7 languagepod101-scraper 132
8 WhatSoup 95
9 web_to_obsidian 32
10 reddit-bots 21
11 tweet-transcriber 19
12 DDD 12
13 PythonAutomateCybersecurity 11
14 Letterboxd-friend-ranker 10
15 Amazon-Product-Information-Scraper 6
16 tabroom-API 6
17 statum 4
18 weheartpy 3
19 python-web-scraping-primjeri 2
20 python_portfolio_web_scraper-spotrac 1
21 israbrew 1
22 flannelfy_os 0
23 GSOC_org_analysis 0
SaaSHub - Software Alternatives and Reviews
SaaSHub helps you find the best software and product alternatives
www.saashub.com