Python HTML Manipulation

Open-source Python projects categorized as HTML Manipulation

Top 10 Python HTML Manipulation Projects

  • xmltodict

    Python module that makes working with XML feel like you are working with JSON

  • bleach

    Bleach is an allowed-list-based HTML sanitizing library that escapes or strips markup and attributes

    Project mention: What's your favorite alternative to bleach for sanitizing HTML? | /r/django | 2023-06-06

    I noticed via the changelog for Django 4.2.2 that bleach is deprecated (Django removed mention of it from their docs).

  • InfluxDB

    Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

  • lxml

    The lxml XML toolkit for Python

  • pyquery

    A jquery-like library for python

  • xhtml2pdf

    A library for converting HTML into PDFs using ReportLab

  • html5lib

    Standards-compliant library for parsing and serializing HTML documents and fragments in Python

  • gazpacho

    🥫 The simple, fast, and modern web scraping library

  • WorkOS

    The modern API for authentication & user identity. The APIs are flexible and easy-to-use, supporting authentication, user identity, and complex enterprise features like SSO and SCIM provisioning.

  • untangle

    Converts XML to Python objects

  • MarkupSafe

    Safely add untrusted strings to HTML/XML markup.

  • xmldataset

    xmldataset: xml parsing made easy 🗃️

NOTE: The open source projects on this list are ordered by number of github stars. The number of mentions indicates repo mentiontions in the last 12 Months or since we started tracking (Dec 2020). The latest post mention was on 2023-06-06.

Python HTML Manipulation related posts


What are some of the best open-source HTML Manipulation projects in Python? This list will help you:

Project Stars
1 xmltodict 5,319
2 bleach 2,596
3 lxml 2,536
4 pyquery 2,262
5 xhtml2pdf 2,141
6 html5lib 1,083
7 gazpacho 727
8 untangle 608
9 MarkupSafe 588
10 xmldataset 77
ChatGPT with full context of any GitHub repo.
Onboard AI learns any GitHub repo in minutes and lets you chat with it to locate functionality, understand different parts, and generate new code. Use it for free at