htmldate

Fast and robust date extraction from web pages, with Python or on the command-line (by adbar)

Htmldate Alternatives

Similar projects and alternatives to htmldate

  1. readability

    56 htmldate VS readability

    A standalone version of the readability lib

  2. CodeRabbit

    CodeRabbit: AI Code Reviews for Developers. Revolutionize your code reviews with AI. CodeRabbit offers PR summaries, code walkthroughs, 1-click suggestions, and AST-based analysis. Boost productivity and code quality across all major languages with each PR.

    CodeRabbit logo
  3. unclutter

    39 htmldate VS unclutter

    A modern reader mode and article library for your browser.

  4. trafilatura

    Python & Command-line tool to gather text and metadata on the Web: Crawling, scraping, extraction, output as CSV, JSON, HTML, MD, TXT, XML

  5. parser

    12 htmldate VS parser

    📜 Extract meaningful content from the chaos of a web page

  6. TWINT

    77 htmldate VS TWINT

    Discontinued An advanced Twitter scraping & OSINT tool written in Python that doesn't use Twitter's API, allowing you to scrape a user's followers, following, Tweets and more while evading most API limitations.

  7. html2text

    1 htmldate VS html2text

    Convert HTML to Markdown-formatted text. (by Alir3z4)

  8. micawber

    a small library for extracting rich content from urls

  9. SaaSHub

    SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives

    SaaSHub logo
  10. lassie

    Web Content Retrieval for Humansâ„¢

  11. newspaper

    newspaper3k is a news, full-text, and article metadata extraction in Python 3. Advanced docs:

  12. dragnet

    6 htmldate VS dragnet

    Just the facts -- web page content extraction

  13. python-readability

    fast python port of arc90's readability tool, updated to match latest readability.js!

  14. toapi

    0 htmldate VS toapi

    Every web site provides APIs.

  15. arc90-readability

    A copy of the original Arc90 repo with links to many of the current ports.

  16. opengraph

    A python module to parse the Open Graph Protocol

  17. Readability4J

    A Kotlin port of Mozilla‘s Readability. It extracts a website‘s relevant content and removes all clutter from it.

  18. dom-distiller

    Discontinued Distills the DOM

  19. python-goose

    Html Content / Article Extractor, web scrapping lib in Python

  20. article-extraction-benchmark

    Article extraction benchmark: dataset and evaluation scripts

  21. textract

    4 htmldate VS textract

    extract text from any document. no muss. no fuss.

  22. go-htmldate

    CLI and Go package for extracting publication date of a web pages.

  23. SaaSHub

    SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives

    SaaSHub logo
NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a better htmldate alternative or higher similarity.

htmldate discussion

Log in or Post with

htmldate reviews and mentions

Posts with mentions or reviews of htmldate. We have used some of these posts to build our list of alternatives and similar projects. The last one was on 2022-03-30.

Stats

Basic htmldate repo stats
1
124
6.9
3 months ago

Sponsored
CodeRabbit: AI Code Reviews for Developers
Revolutionize your code reviews with AI. CodeRabbit offers PR summaries, code walkthroughs, 1-click suggestions, and AST-based analysis. Boost productivity and code quality across all major languages with each PR.
coderabbit.ai

Did you know that Python is
the 2nd most popular programming language
based on number of references?