HTML Readability

Open-source HTML projects categorized as Readability

Top 4 HTML Readability Projects

  • go-readability

    Go package that cleans a HTML page for better readability.

  • ReadabiliPy

    A simple HTML content extractor in Python. Can be run as a wrapper for Mozilla's Readability.js package or in pure-python mode.

    Project mention: Mozilla: Readability.js | news.ycombinator.com | 2024-02-25

    I have used and love readability.js. I used it in an application that lets you run various NLP analyses over a web page (surprisals, reading time, word counts, etc.). For that, I needed only the main page content. readability.js retrieves main page content well, consistently.

    The Alan Turing Institute maintains a Python wrapper around readability.js, too: https://github.com/alan-turing-institute/ReadabiliPy.

  • InfluxDB

    Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

  • Readability4J

    A Kotlin port of Mozilla‘s Readability. It extracts a website‘s relevant content and removes all clutter from it.

    Project mention: Creating an advanced search engine with PostgreSQL | news.ycombinator.com | 2023-07-12

    Depending upon the type of content, one might want to look into using the Readability (Browder's reader view) to parse the webpage. It will give you all the useful info without the junk. Then you can put it in the DB as needed.

    https://github.com/mozilla/readability

    Btw, readability, is also available in few other languages like Kotlin:

    https://github.com/dankito/Readability4J

  • readable

    📖 A service for reading long-form content on any device

NOTE: The open source projects on this list are ordered by number of github stars. The number of mentions indicates repo mentiontions in the last 12 Months or since we started tracking (Dec 2020). The latest post mention was on 2024-02-25.

HTML Readability related posts

Index

What are some of the best open-source Readability projects in HTML? This list will help you:

Project Stars
1 go-readability 643
2 ReadabiliPy 179
3 Readability4J 128
4 readable 78
SaaSHub - Software Alternatives and Reviews
SaaSHub helps you find the best software and product alternatives
www.saashub.com