Python HTML Manipulation

Open-source Python projects categorized as HTML Manipulation

Top 10 Python HTML Manipulation Projects

HTML Manipulation
  1. xmltodict

    Python module that makes working with XML feel like you are working with JSON

  2. InfluxDB

    InfluxDB – Built for High-Performance Time Series Workloads. InfluxDB 3 OSS is now GA. Transform, enrich, and act on time series data directly in the database. Automate critical tasks and eliminate the need to move data externally. Download now.

    InfluxDB logo
  3. lxml

    The lxml XML toolkit for Python

    Project mention: Release 0.44.0 of Spellcheck (GitHub) Action - baby-steps maintenance | dev.to | 2024-10-25

    lxml bumped from version 4.9.3 to 5.3.0, see release notes

  4. bleach

    Bleach is an allowed-list-based HTML sanitizing library that escapes or strips markup and attributes

  5. pyquery

    A jquery-like library for python

  6. xhtml2pdf

    A library for converting HTML into PDFs using ReportLab

  7. html5lib

    Standards-compliant library for parsing and serializing HTML documents and fragments in Python

    Project mention: This Week In Python | dev.to | 2025-03-21

    html5lib-python – Standards-compliant library for parsing and serializing HTML documents and fragments in Python

  8. gazpacho

    🥫 The simple, fast, and modern web scraping library

  9. SaaSHub

    SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives

    SaaSHub logo
  10. MarkupSafe

    Safely add untrusted strings to HTML/XML markup.

  11. untangle

    Converts XML to Python objects

  12. xmldataset

    xmldataset: xml parsing made easy 🗃️

NOTE: The open source projects on this list are ordered by number of github stars. The number of mentions indicates repo mentiontions in the last 12 Months or since we started tracking (Dec 2020).

Python HTML Manipulation discussion

Log in or Post with

Python HTML Manipulation related posts

  • Release 0.44.0 of Spellcheck (GitHub) Action - baby-steps maintenance

    5 projects | dev.to | 25 Oct 2024
  • Show HN: Finic – open-source platform for building browser automations

    10 projects | news.ycombinator.com | 17 Sep 2024
  • GitHub – GSA/code-gov: An informative repo for all Code.gov repos

    12 projects | news.ycombinator.com | 9 Sep 2023
  • What's your favorite alternative to bleach for sanitizing HTML?

    3 projects | /r/django | 6 Jun 2023
  • 8 Most Popular Python HTML Web Scraping Packages with Benchmarks

    4 projects | dev.to | 1 Feb 2023
  • Bleach 6.0.0 Release and Deprecation

    1 project | news.ycombinator.com | 27 Jan 2023
  • XML to CSV or JSON using Cloud Function

    1 project | /r/googlecloud | 14 Dec 2022
  • A note from our sponsor - SaaSHub
    www.saashub.com | 13 Jun 2025
    SaaSHub helps you find the best software and product alternatives Learn more →

Index

What are some of the best open-source HTML Manipulation projects in Python? This list will help you:

# Project Stars
1 xmltodict 5,619
2 lxml 2,843
3 bleach 2,704
4 pyquery 2,357
5 xhtml2pdf 2,308
6 html5lib 1,195
7 gazpacho 769
8 MarkupSafe 654
9 untangle 620
10 xmldataset 78

Sponsored
InfluxDB – Built for High-Performance Time Series Workloads
InfluxDB 3 OSS is now GA. Transform, enrich, and act on time series data directly in the database. Automate critical tasks and eliminate the need to move data externally. Download now.
www.influxdata.com

Did you know that Python is
the 2nd most popular programming language
based on number of references?