Python & command-line tool to gather text on the Web: web crawling/scraping, extraction of text, metadata, comments
Here you can share your experience with the project you are suggesting or its comparison with trafilatura. Optional.
A valid email to send you a verification link when necessary or log in.