TypeScript Puppeteer

Open-source TypeScript projects categorized as Puppeteer

Top 23 TypeScript Puppeteer Projects

  1. crawlee

    Crawleeβ€”A web scraping and browser automation library for Node.js to build reliable crawlers. In JavaScript and TypeScript. Extract data for AI, LLMs, RAG, or GPTs. Download HTML, PDF, JPG, PNG, and other files from websites. Works with Puppeteer, Playwright, Cheerio, JSDOM, and raw HTTP. Both headful and headless mode. With proxy rotation.

    Project mention: Scraperr – A Self Hosted Webscraper | news.ycombinator.com | 2025-05-11

    If you're a fan of Playwright check out Crawlee [0]. I've used it for a few small projects and it's been faster for me to get what I've needed done.

    [0] https://crawlee.dev/

  2. SurveyJS

    JavaScript Form Builder with No-Code UI & Built-In JSON Schema Editor. Add the SurveyJS white-label form builder to your JavaScript app (React/Angular/Vue3). Build complex JSON forms without coding. Fully customizable, works with any backend, perfect for data-heavy apps. Learn more.

    SurveyJS logo
  3. stagehand

    An AI web browsing framework focused on simplicity and extensibility.

    Project mention: Anthropic launches its own $200 plan | news.ycombinator.com | 2025-04-09

    Depends how you define "agentic approach"? With something like smol agents + stagehand you can do pretty much anything while being provider agnostic.

    https://github.com/browserbase/stagehand

  4. browserless

    Deploy headless browsers in Docker. Run on our cloud or bring your own. Free for non-commercial uses.

  5. llm-scraper

    Turn any webpage into structured data using LLMs

    Project mention: Scraperr – A Self Hosted Webscraper | news.ycombinator.com | 2025-05-11

    llm-scraper [1] does a decent job but it's still a bit fragile. The biggest problem I have is all the React CSS-in-JS libraries that use hashes in their class names, which the LLM isn't smart enough to ignore.

    [1] https://github.com/mishushakov/llm-scraper

  6. jest-puppeteer

    Run tests using Jest & Puppeteer πŸŽͺ✨

  7. puppeteer-cluster

    Puppeteer Pool, run a cluster of instances in parallel

  8. chrome-aws-lambda

    Chromium Binary for AWS Lambda and Google Cloud Functions

  9. InfluxDB

    InfluxDB – Built for High-Performance Time Series Workloads. InfluxDB 3 OSS is now GA. Transform, enrich, and act on time series data directly in the database. Automate critical tasks and eliminate the need to move data externally. Download now.

    InfluxDB logo
  10. pwa-asset-generator

    Automates PWA asset generation and image declaration. Automatically generates icon and splash screen images, favicons and mstile images. Updates manifest.json and index.html files with the generated images according to Web App Manifest specs and Apple Human Interface guidelines.

  11. x-crawl

    x-crawl is a flexible Node.js multifunctional crawler library. Flexible usage and numerous functions can help you quickly, safely, and stably crawl pages, interfaces, and files.

  12. fingerprint-suite

    Browser fingerprinting tools for anonymizing your scrapers. Developed by Apify.

    Project mention: Browser fingerprinting tools for anonymizing scrapers | news.ycombinator.com | 2024-12-05
  13. md-to-pdf

    Hackable CLI tool for converting Markdown files to PDF using Node.js and headless Chrome.

  14. Spearmint

    Testing, simplified. || An inclusive, accessibility-first GUI for generating clean, semantic Javascript tests in only a few clicks of a button. (by open-source-labs)

  15. BotD

    Bot detection library that runs in the browser. Detects automation tools and frameworks. No server required, runs 100% on the client. MIT license, no usage restrictions.

  16. replay

    Library that provides an API to replay and stringify recordings created using Chrome DevTools Recorder (by puppeteer)

  17. adblocker

    Efficient embeddable adblocker library

    Project mention: Ask HN: How to remove Ads from a downloaded HTML file to output an ad free file? | news.ycombinator.com | 2024-11-08

    Do you have to use Curl? It wouldn't render a lot of sites correctly anyway (anything that uses JS for rendering).

    Can you run a puppeteer/playwright instance and add an ad blocker to that? e.g. https://github.com/ghostery/adblocker or https://github.com/microsoft/playwright-python/issues/782

  18. secret-agent

    The web scraper that's nearly impossible to block - now called @ulixee/hero

  19. linvo-scraper

    Linkedin Automation Bot with every possible scraping! Valid for 2022 used by Linvo.io

  20. Recorder

    A browser extension that generates Cypress, Playwright and Puppeteer test scripts from your interactions πŸ–± ⌨ (by DeploySentinel)

  21. deno-puppeteer

    A port of puppeteer running on Deno

  22. astral

    A high-level puppeteer/playwright-like library for Deno (by lino-levan)

  23. Twitch-Drops-Bot

    A Node.js bot that will automatically watch Twitch streams and claim drop rewards.

  24. puppeteer-ide-extension

    Standalone puppeteer playground in browser's developer tools.

  25. puppeteer-report

    Convert HTML to PDF by Puppeteer with support of adding a custom header, footer, and page number

  26. SaaSHub

    SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives

    SaaSHub logo
NOTE: The open source projects on this list are ordered by number of github stars. The number of mentions indicates repo mentiontions in the last 12 Months or since we started tracking (Dec 2020).

TypeScript Puppeteer discussion

Log in or Post with

TypeScript Puppeteer related posts

  • Scraperr – A Self Hosted Webscraper

    6 projects | news.ycombinator.com | 11 May 2025
  • Anthropic launches its own $200 plan

    1 project | news.ycombinator.com | 9 Apr 2025
  • πŸš€ Why Single-Page Applications (SPAs) Are Taking Over the Web – Are You Ready?

    1 project | dev.to | 25 Mar 2025
  • 5 Free Open-Source Tools to Make Your Next Project Easier

    3 projects | dev.to | 26 Dec 2024
  • Browser fingerprinting tools for anonymizing scrapers

    1 project | news.ycombinator.com | 5 Dec 2024
  • Stagehand

    1 project | news.ycombinator.com | 2 Nov 2024
  • How and why we ripped our Open Source product apart for a full rebuild

    1 project | dev.to | 28 Feb 2024
  • A note from our sponsor - SaaSHub
    www.saashub.com | 20 May 2025
    SaaSHub helps you find the best software and product alternatives Learn more β†’

Index

What are some of the best open-source Puppeteer projects in TypeScript? This list will help you:

# Project Stars
1 crawlee 17,672
2 stagehand 11,805
3 browserless 10,113
4 llm-scraper 4,863
5 jest-puppeteer 3,545
6 puppeteer-cluster 3,387
7 chrome-aws-lambda 3,270
8 pwa-asset-generator 2,838
9 x-crawl 1,712
10 fingerprint-suite 1,408
11 md-to-pdf 1,359
12 Spearmint 1,281
13 BotD 1,217
14 replay 1,156
15 adblocker 888
16 secret-agent 705
17 linvo-scraper 617
18 Recorder 467
19 deno-puppeteer 458
20 astral 315
21 Twitch-Drops-Bot 283
22 puppeteer-ide-extension 226
23 puppeteer-report 152

Sponsored
JavaScript Form Builder with No-Code UI & Built-In JSON Schema Editor
Add the SurveyJS white-label form builder to your JavaScript app (React/Angular/Vue3). Build complex JSON forms without coding. Fully customizable, works with any backend, perfect for data-heavy apps. Learn more.
surveyjs.io

Did you know that TypeScript is
the 1st most popular programming language
based on number of references?