python-readability

fast python port of arc90's readability tool, updated to match latest readability.js! (by buriy)

Python-readability Alternatives

Similar projects and alternatives to python-readability

  1. pandoc

    Universal markup converter

  2. InfluxDB

    InfluxDB – Built for High-Performance Time Series Workloads. InfluxDB 3 OSS is now GA. Transform, enrich, and act on time series data directly in the database. Automate critical tasks and eliminate the need to move data externally. Download now.

    InfluxDB logo
  3. omnivore

    Omnivore is a complete, open source read-it-later solution for people who like reading.

  4. llm

    Access large language models from the command-line

  5. markdownload

    A Firefox and Google Chrome extension to clip websites and download them into a readable markdown file.

  6. parser

    📜 Extract meaningful content from the chaos of a web page

  7. to-markdown

    🛏 An HTML to Markdown converter written in JavaScript

  8. tidy-html5

    The granddaddy of HTML tools, with support for modern standards

  9. SaaSHub

    SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives

    SaaSHub logo
  10. html2text

    Convert HTML to Markdown-formatted text. (by Alir3z4)

  11. newspaper

    newspaper3k is a news, full-text, and article metadata extraction in Python 3. Advanced docs:

  12. webscrapbook

    A browser extension that captures web pages to local device or backend server for future retrieval, organization, annotation, and edit. This project inherits from legacy Firefox add-on ScrapBook X.

  13. python-goose

    Html Content / Article Extractor, web scrapping lib in Python

  14. sanitize

    Bringing sanity to world of messed-up data

  15. metascraper

    Get unified metadata from websites using Open Graph, Microdata, RDFa, Twitter Cards, JSON-LD, HTML, and more.

  16. opengraph

    A python module to parse the Open Graph Protocol

  17. sumy

    Module for automatic summarization of text documents and HTML pages.

  18. textract

    extract text from any document. no muss. no fuss.

  19. easy-astro-blog-creator

    An easy personal blog template for Github Pages.

  20. toapi

    Every web site provides APIs.

  21. scrapedown

    A simple worker for extracting page content for a given URL

  22. SaaSHub

    SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives

    SaaSHub logo
NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a better python-readability alternative or higher similarity.

python-readability discussion

Log in or Post with

python-readability reviews and mentions

Posts with mentions or reviews of python-readability. We have used some of these posts to build our list of alternatives and similar projects. The last one was on 2024-04-14.

Stats

Basic python-readability repo stats
5
2,768
7.2
9 days ago

Sponsored
InfluxDB – Built for High-Performance Time Series Workloads
InfluxDB 3 OSS is now GA. Transform, enrich, and act on time series data directly in the database. Automate critical tasks and eliminate the need to move data externally. Download now.
www.influxdata.com

Did you know that Python is
the 2nd most popular programming language
based on number of references?