Most universal way to get web article text?

This page summarizes the projects mentioned and recommended in the original post on /r/learnpython

Our great sponsors
  • SurveyJS - Open-Source JSON Form Builder to Create Dynamic Forms Right in Your App
  • WorkOS - The modern identity platform for B2B SaaS
  • InfluxDB - Power Real-Time Data Analytics at Scale
  • readability

    A standalone version of the readability lib

  • If you know/can read some JavaScript you can have a look at Mozilla's readability module which they use to extract content in Firefox's Reader View. It's probably easiest if you start at the parse() method and work your way back from there. The code is heavily annotated, so even if you're not too well versed in JS you should be able to understand the general functionality/logic behind the code.

  • SurveyJS

    Open-Source JSON Form Builder to Create Dynamic Forms Right in Your App. With SurveyJS form UI libraries, you can build and style forms in a fully-integrated drag & drop form builder, render them in your JS app, and store form submission data in any backend, inc. PHP, ASP.NET Core, and Node.js.

    SurveyJS logo
NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a more popular project.

Suggest a related project

Related posts