html5lib VS harfbuzz

Compare html5lib vs harfbuzz and see what are their differences.

html5lib

Standards-compliant library for parsing and serializing HTML documents and fragments in Python (by html5lib)
Our great sponsors
  • InfluxDB - Power Real-Time Data Analytics at Scale
  • WorkOS - The modern identity platform for B2B SaaS
  • SaaSHub - Software Alternatives and Reviews
html5lib harfbuzz
3 33
1,095 3,581
0.9% 3.5%
4.1 9.8
about 2 months ago 1 day ago
Python C++
MIT License GNU General Public License v3.0 or later
The number of mentions indicates the total number of mentions that we've tracked plus the number of user suggested alternatives.
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.

html5lib

Posts with mentions or reviews of html5lib. We have used some of these posts to build our list of alternatives and similar projects. The last one was on 2021-11-25.
  • Bleach 6.0.0 Release and Deprecation
    1 project | news.ycombinator.com | 27 Jan 2023
    Yes. This is really interesting.

    Sounds like html5lib has been asking for funding, but doesn't look like there's much progress. https://github.com/html5lib/html5lib-python/issues/361

  • Pydantic Factories
    2 projects | /r/Python | 25 Nov 2021
    Neither did html5lib.
  • Why are circular dependencies even a thing?
    3 projects | /r/linuxquestions | 25 Sep 2021
    Easier example...sphinx is a document generator for python programs (creating docs for the API of programs from source-code comments for example). Spinx depends on html5lib which itself again depends on six...want to make a guess what six uses to generate its API docs? ;) So if you want the api docs of six you will have to first install it without to be able to get a working sphinx install then redo the six on including the building of the API docs.

harfbuzz

Posts with mentions or reviews of harfbuzz. We have used some of these posts to build our list of alternatives and similar projects. The last one was on 2024-03-20.
  • HarfBuzz: Text Shaping Engine
    1 project | news.ycombinator.com | 17 Apr 2024
  • Rive Renderer – now open source and available on all platforms
    7 projects | news.ycombinator.com | 20 Mar 2024
  • Libsodium: A modern, portable, easy to use crypto library
    9 projects | news.ycombinator.com | 14 Sep 2023
    For C/C++ projects that use meson as the build system, there is an excellent way to manage dependencies:

    https://mesonbuild.com/Wrapdb-projects.html

    https://mesonbuild.com/Wrap-dependency-system-manual.html

    meson will download and build the libraries automatically and give you a variable which you pass as a regular dependency into the built target:

    https://github.com/qemu/qemu/tree/005ad32358f12fe9313a4a0191...

    https://github.com/harfbuzz/harfbuzz/tree/main/subprojects

    https://github.com/harfbuzz/harfbuzz/blob/37457412b3212463c5...

    Or, if you're using proper operating systems, they're managed by the usual package manager, just like everything else.

  • The Web Assembly Shaper
    1 project | /r/hypeurls | 9 Jul 2023
    2 projects | news.ycombinator.com | 9 Jul 2023
  • Text Rendering Hates You
    5 projects | news.ycombinator.com | 26 Jun 2023
    If you sympathize with the travails of people working on text rendering in applications, please consider supporting (among other projects):

    1. The LibreOffice project (libreoffice.org), the free office application suite. This is where the rubber hits the road and developers deal with the extreme complexities of everything regarding text - shaping, styling, multi-object interaction, multi-language, you name it. And - they/we absolutely need donations to manage a project with > 200 million users: https://www.libreoffice.org/donate

    2. harfbuzz (https://harfbuzz.github.io), and specifically Behdad Esfahood the main contributor. Although, TBH, I've not quite figured out whether you can donate to that or to him. At least star the project on GitHub I guess.

  • ImGui or text rendering libraries
    7 projects | /r/C_Programming | 6 Apr 2023
    As for text, it depends very heavily on what exactly you need. Simple ASCII text and bitmap fonts? Just do it yourself or get a .bdf parser. Simple Latin/Cyrillic-like writing with ok-looking vector fonts (ttfs)? stb_truetype has all you need. Font hinting, subpixel rendering? You use freetype. More complex writing like Arabic? You will have to do shaping as well, say with HarfBuzz. Need right-to-left or unidirectional text? Hypenation? Go for platform APIs if you can (DirectWrite om Windows, CoreText on Mac).
  • QuestPDF: Modern .NET library for PDF document generation
    14 projects | news.ycombinator.com | 18 Jan 2023
    Gold standard? Even though serious bugs are not fixed [1] because "the code is too fragile to touch at this point"? Looks like Android uses HarfBuzz, if so it can't be that bad.

    [1] https://github.com/harfbuzz/harfbuzz/issues/2814

  • A Programmable Markup Language for Typesetting [pdf]
    9 projects | news.ycombinator.com | 18 Jan 2023
    The linked libraries are not even close to solving limited subsets of problems solved by FreeType or HarfBuzz. No test is needed if they do not even have a working implementation of particular requisites: Do they work on heterogeneous layouts, directions, languages, locales, scripts, symbols and composites, extensions, variations, legacy, missing, partial or corrupted instructions, standards interpretations, platforms, output devices, nonstandard point structures and grids?

    They do not. What they solve is almost a toy problem compared to the size, scope and breadth of these libraries.

    Just because some project is implemented in Rust does not make it comparable never mind superior by default.

    There is a world out there and it is not homogeneous format and standards-compliant Latin fonts in English LTR text in linear disposition with some generic rectangular subpixel rendering on a regular rectangular grid.

    I warmly welcome you to browse closed issues of FreeType [1] and also the closed issues of HarfBuzz [2]. If you feel inspired please do also look into mailing lists and discussion pages related to the development, building, tracking and patching of packages of these projects in any of the numerous places it is used.

    The only argument Rust people have is in relation WASM but if you insist in targeting WASM why not fork FreeType, strip it to the strict subset of features your application needs and target it?

    Why do it in the first place? Why reinvent the wheel?

    As such I will restate my view: I see no gain in using any of these subpar libraries.

    [1] https://gitlab.freedesktop.org/freetype/freetype/-/issues/?s...

    [2] https://github.com/harfbuzz/harfbuzz/issues?q=is%3Aclosed

  • Harfbuzz 6.0
    1 project | news.ycombinator.com | 31 Dec 2022

What are some alternatives?

When comparing html5lib and harfbuzz you can also consider the following projects:

lxml - The lxml XML toolkit for Python

imgui-sfml - Dear ImGui backend for use with SFML

xhtml2pdf - A library for converting HTML into PDFs using ReportLab

nanovg - Antialiased 2D vector drawing library on top of OpenGL for UI and visualizations.

bleach - Bleach is an allowed-list-based HTML sanitizing library that escapes or strips markup and attributes

contour - Modern C++ Terminal Emulator

selectolax - Python binding to Modest and Lexbor engines (fast HTML5 parser with CSS selectors).

c-ares - A C library for asynchronous DNS requests

pyquery - A jquery-like library for python

imgui_sdl - ImGuiSDL: SDL2 based renderer for Dear ImGui

gazpacho - 🥫 The simple, fast, and modern web scraping library

Tehreer-Android - Standalone text engine for Android aimed to be free from platform limitations