JavaScript web-scraping

Open-source JavaScript projects categorized as web-scraping

Top 15 JavaScript web-scraping Projects

web-scraping
  1. patchright

    Undetected version of the Playwright testing and automation library.

    Project mention: Closer to the Metal: Leaving Playwright for CDP | news.ycombinator.com | 2025-08-20

    Chromium is the only browser that has extensive undetectable/automation support. Look at patchright: https://github.com/Kaliiiiiiiiii-Vinyzu/patchright?tab=readm...

  2. Sevalla

    Deploy and host your apps and databases, now with $50 credit! Sevalla is the PaaS you have been looking for! Advanced deployment pipelines, usage-based pricing, preview apps, templates, human support by developers, and much more!

    Sevalla logo
  3. brightdata-mcp

    A powerful Model Context Protocol (MCP) server that provides an all-in-one solution for public web access.

    Project mention: [Live Demo] Automate a Viral Newsletter Using n8n + Real-Time Web Data | dev.to | 2025-08-25

    Here is the recording from my livestream with Rafael Levi from Bright Data. In the stream, we show you how to create a daily newsletter on the topics you care about powered by real-time data from your favorite sources.

  4. rebrowser-patches

    Collection of patches for puppeteer and playwright to avoid automation detection and leaks. Helps to avoid Cloudflare and DataDome CAPTCHA pages. Easy to patch/unpatch, can be enabled/disabled on demand.

    Project mention: Rebrowser Patches – Patches for undetectable browser automation | news.ycombinator.com | 2025-04-25
  5. single-file-cli

    CLI tool for saving a faithful copy of a complete web page in a single HTML file (based on SingleFile)

    Project mention: SingleFile – Effortlessly Save and Preserve Web Pages | news.ycombinator.com | 2025-07-06
  6. Netflix-Clone

    Netflix like full-stack application with SPA client and backend implemented in service oriented architecture (by yuchiu)

  7. html-metadata

    MetaData html scraper and parser for Node.js (supports Promises and callback style)

    Project mention: HTML-metadata – a Node.js library that extracts metadata from HTML pages | news.ycombinator.com | 2024-10-16
  8. raise

    A simple (and unofficial) GitHub Trending client that lives in your menubar.

  9. InfluxDB

    InfluxDB – Built for High-Performance Time Series Workloads. InfluxDB 3 OSS is now GA. Transform, enrich, and act on time series data directly in the database. Automate critical tasks and eliminate the need to move data externally. Download now.

    InfluxDB logo
  10. agentql-mcp

    Model Context Protocol server that integrates AgentQL's data extraction capabilities.

    Project mention: 11 Practical Ways I Use AI Agents Without Losing My Authenticity | dev.to | 2025-04-21

    Enable the AgentQL extension

  11. Oxylabs

    Web scraping, data parsing and automation tutorials.

  12. node-google-reviews-web-scraper

  13. planit

    Assignment tracker for students that automatically compiles due dates into an interactive calendar

  14. Kraker-Local-Proxy-Server

    A local proxy server based on Node.js for use with desktop web browsers. Designed for web scraping. Includes HTTP, HTTPS and Socks5 ports with integrated DNS and DNS-over-HTTPS.

    Project mention: Ask HN: 1000+ Visitors, how to get more conversions on your SaaS? | news.ycombinator.com | 2024-09-21

    I've been asking the same question myself though I'm not trying to get paying customers. I'm offering free stuff but I can't get many takers. One customer per 1000 views is pretty much the best you can expect. You need a million views to snag 1000 customers. I would suggest that you stick to your plan and go through the free routes first. This will help you to gauge the level of interest before you bankrupt yourself paying for marketing (which never lives up to expectations).

    I do have an anecdote. I have a repository on the Internet Archive. One of my items had just a few hundred views but, all of a sudden, it accumulated over 3000 views. I posted a comment on the item asking what sparked the sudden interest. Someone replied that it was a user comment on Youtube. Makes me wonder if I'm not making a huge mistake by not registering on Youtube so I can post self-promotional comments everywhere.

    Obligatory shameless plug is below.

    https://8chananon.github.io

  15. Web-Scraper

    Simple Web scraping app to scrape all the Indian Presidents (Name and Birthdays) present on Wikipedia. (by Garima-sharma814)

  16. cloudflare-bypass

    This repository provides minimal working examples for bypassing Cloudflare 1020 errors using Playwright in both Python and Node.js. The focus is on showing basic setups to load pages that are often protected by anti-bot measures, with step-by-step improvements like stealth plugins, proxy rotation, and human-like behaviors. (by HasData)

    Project mention: Node.js + Playwright Stealth: Bypass Cloudflare 1020 in 5 Minutes | dev.to | 2025-06-25

    For Python fans, check our GitHub repo. Node.js examples are there too. If it helped, give it a star.

  17. google-maps-scraper

    Scrape Google Maps search results using Selenium, Playwright (stealth), or HasData API in Python & Node.js. Extract names, ratings, reviews, and more - export to CSV/JSON. (by HasData)

    Project mention: Simple Google Maps Scraper Using Playwright | dev.to | 2025-08-06

    How to Scrape Google Maps Data Using Python Github repo with examples on Python and NodeJS Join our Discord

  18. SaaSHub

    SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives

    SaaSHub logo
NOTE: The open source projects on this list are ordered by number of github stars. The number of mentions indicates repo mentiontions in the last 12 Months or since we started tracking (Dec 2020).

JavaScript web-scraping discussion

Log in or Post with

JavaScript web-scraping related posts

  • Introducing Web Scraping with Scrapy.js

    1 project | dev.to | 19 Oct 2021
  • Rabibashoriyo Short stories in one place :)

    1 project | /r/westbengal | 11 May 2021

Index

What are some of the best open-source web-scraping projects in JavaScript? This list will help you:

# Project Stars
1 patchright 1,402
2 brightdata-mcp 1,162
3 rebrowser-patches 1,001
4 single-file-cli 950
5 Netflix-Clone 269
6 html-metadata 175
7 raise 149
8 agentql-mcp 104
9 Oxylabs 64
10 node-google-reviews-web-scraper 23
11 planit 10
12 Kraker-Local-Proxy-Server 10
13 Web-Scraper 4
14 cloudflare-bypass 3
15 google-maps-scraper 0

Sponsored
Deploy and host your apps and databases, now with $50 credit!
Sevalla is the PaaS you have been looking for! Advanced deployment pipelines, usage-based pricing, preview apps, templates, human support by developers, and much more!
sevalla.com