A reimplementation of the Readability/Decruft algorithm using BeautifulSoup and html5lib
Why do you think that https://github.com/scrapinghub/article-extraction-benchmark is a good alternative to soup-strainer
A reimplementation of the Readability/Decruft algorithm using BeautifulSoup and html5lib
Why do you think that https://github.com/scrapinghub/article-extraction-benchmark is a good alternative to soup-strainer