Best web scraping framework to learn

This page summarizes the projects mentioned and recommended in the original post on /r/webscraping

SurveyJS - Open-Source JSON Form Builder to Create Dynamic Forms Right in Your App
With SurveyJS form UI libraries, you can build and style forms in a fully-integrated drag & drop form builder, render them in your JS app, and store form submission data in any backend, inc. PHP, ASP.NET Core, and Node.js.
surveyjs.io
featured
InfluxDB - Power Real-Time Data Analytics at Scale
Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
www.influxdata.com
featured
  • crawlee

    Crawlee—A web scraping and browser automation library for Node.js to build reliable crawlers. In JavaScript and TypeScript. Extract data for AI, LLMs, RAG, or GPTs. Download HTML, PDF, JPG, PNG, and other files from websites. Works with Puppeteer, Playwright, Cheerio, JSDOM, and raw HTTP. Both headful and headless mode. With proxy rotation.

  • https://crawlee.dev/ its very good, you can easily run your spiders in cloud with apify, and nodejs/puppeteer has many advantages than python/selenium

  • SurveyJS

    Open-Source JSON Form Builder to Create Dynamic Forms Right in Your App. With SurveyJS form UI libraries, you can build and style forms in a fully-integrated drag & drop form builder, render them in your JS app, and store form submission data in any backend, inc. PHP, ASP.NET Core, and Node.js.

    SurveyJS logo
NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a more popular project.

Suggest a related project

Related posts

  • Deep diving into Apify world

    1 project | /r/thewebscrapingclub | 2 Apr 2023
  • Playwright or Cypress?

    1 project | /r/PinoyProgrammer | 10 Feb 2023
  • Launching Crawlee Blog: Your Node.js resource hub for web scraping and automation.

    1 project | dev.to | 26 Feb 2024
  • Anything like scrapy in other languages?

    1 project | /r/webscraping | 10 Dec 2023
  • Show HN: Crawlee - Node.js的网络刮削和浏览器自动化库 (Show HN: Crawlee – The web scraping and browser automation library for Node.js)

    1 project | /r/hnzh | 23 Aug 2022