TypeScript Puppeteer

Open-source TypeScript projects categorized as Puppeteer

Top 23 TypeScript Puppeteer Projects

  • crawlee

    Crawlee—A web scraping and browser automation library for Node.js that helps you build reliable crawlers. Fast.

    Project mention: Spidergram is a collection of tools my company Autogram has built or enabled over the past several years to support our work to automate content inventories for large websites: it's part web crawler, part domain model, and part mad science. We released the first public beta today. | reddit.com/r/webscraping | 2022-12-02

    Apify's Crawlee project, with a specific focus on Playwright. We decided to focus on it for now because the majority of our projects involve some kind of cross-browser evaluation for clients, and Playwright's ability to swap in Safari and Firefox rendering engines was a huge help.

  • chrome

    The browserless Chrome service in Docker. Run on our cloud, or bring your own.

    Project mention: Angular docker base image | reddit.com/r/angular | 2022-12-20

    I had a look to this one: https://github.com/browserless/chrome ... but it is not suitable for builds, e.g. set to production mode, user permissions and so on.

  • Appwrite

    Appwrite - The Open Source Firebase alternative introduces iOS support . Appwrite is an open source backend server that helps you build native iOS applications much faster with realtime APIs for authentication, databases, files storage, cloud functions and much more!

  • qawolf

    🐺 Create browser tests 10x faster

  • chrome-aws-lambda

    Chromium Binary for AWS Lambda and Google Cloud Functions

    Project mention: Building a PDF Generator using AWS Lambda | dev.to | 2022-10-08

    git clone --depth=1 https://github.com/alixaxel/chrome-aws-lambda.git && \ cd chrome-aws-lambda && \ make chrome_aws_lambda.zip

  • puppeteer-cluster

    Puppeteer Pool, run a cluster of instances in parallel

    Project mention: Looking for something and I'm not sure what it would be called..... | reddit.com/r/selfhosted | 2022-11-13

    You could set up a service with something like playwright, puppeteer puppeteer-cluster, browserless, to access the service internally and serve screenshots of it to the outside user. You'd have to set up probably some kind of web service with the appropriate routes.

  • pwa-asset-generator

    Automates PWA asset generation and image declaration. Automatically generates icon and splash screen images, favicons and mstile images. Updates manifest.json and index.html files with the generated images according to Web App Manifest specs and Apple Human Interface guidelines.

    Project mention: Gostaria de ajuda com PWA | reddit.com/r/programacao | 2023-01-04
  • md-to-pdf

    Hackable CLI tool for converting Markdown files to PDF using Node.js and headless Chrome.

    Project mention: Convert Markdown format files to PDF files (mermaid/emoji/toc compatible) | dev.to | 2022-09-01

    I could have used md-to-pdf, but it seemed like it would take a little time, so I implemented it as quickly as possible, like a hackathon. I wanted to, so I decided to use a library called md-to-pdf-ng.

  • InfluxDB

    Build time-series-based applications quickly and at scale.. InfluxDB is the Time Series Platform where developers build real-time applications for analytics, IoT and cloud-native services. Easy to start, it is available in the cloud or on-premises.

  • unlighthouse

    Unlighthouse scans your entire site using Google Lighthouse, with a modern UI, minimal config and smart sampling.

    Project mention: How to improve the PageSpeed score of your Nuxt.js website in 6 steps | dev.to | 2022-07-18

    After deploying all these changes on our website, the PageSpeed score jumped from 23 to 97. To ensure that the numbers won't drop over time, I'd recommend setting up the https://unlighthouse.dev checks for your CI/CD.

  • adblocker

    Efficient embeddable adblocker library

    Project mention: Mozilla reaffirms that Firefox will continue to support current content blockers | news.ycombinator.com | 2022-09-24

    No well-known content blocker "process about 90,000 regexps" to find out whether a resource needs to be blocked or not, that's just not how it works internally.

    Last time I ran benchmarks of all well-known content blockers using Ghostery's benchmark tool[1], all of them could process a network request under 20µs on average.

    Some do have performance concerns, but it has nothing to do with network filtering, it has to do with other stuff they do beyond network filtering (for example see [2]) and declarativeNetRequest does not help there, so they will still suffer these performance issues under MV3.

    ---

    [1] https://github.com/ghostery/adblocker/tree/master/packages/a...

    [2] https://www.extremetech.com/computing/182428-ironic-iframes-...

  • secret-agent

    The web scraper that's nearly impossible to block - now called @ulixee/hero

    Project mention: Way to scrape any dynamic data from any site / Server for scaping | reddit.com/r/webscraping | 2022-04-17

    Automated Browser under hood is SecretAgent, thats masking alot bot stuff and bypass huge amount of anti-bot systems (also be patient to your proxy quality)

  • linvo-scraper

    Linkedin Automation Bot with every possible scraping! Valid for 2022 used by Linvo.io

    Project mention: Linvo-Scraper: LinkedIn Automation Bot | reddit.com/r/hypeurls | 2022-10-08
  • replay

    Library that provides an API to replay and stringify recordings created using Chrome DevTools Recorder (by puppeteer)

    Project mention: Chrome Recorder: Record, Replay and Measure User Flows | dev.to | 2022-04-28

    You can start a recording, execute the steps you’d like to record in the app (such as typing or clicking), and export the recording as JSON file, Puppeteer script or @puppeteer/replay script.

  • tinking

    🧶 Extract data from any website without code, just clicks.

  • deno-puppeteer

    A port of puppeteer running on Deno

    Project mention: Unity ships Node-IPC vulnerability | reddit.com/r/programming | 2022-03-21

    In an ideal world, yes, but in reality Deno's permission model is quite hard to use so many libraries require disabling a lot of it, sometimes everything like puppeteer. Other example is the library I maintain: it requires --allow-net since you can't whitelist a domain and all its subdomains, just a domain.

  • Recorder

    A browser extension that generates Cypress, Playwright and Puppeteer test scripts from your interactions 🖱 ⌨ (by DeploySentinel)

    Project mention: I built an alternative for the deprecated & beloved Cypress Studio - generate Cypress scripts from your browser interactions right inside of Cypress | reddit.com/r/vuejs | 2022-07-26
  • root-cause

    🔍 Root Cause is a tool for troubleshooting Puppeteer and Playwright tests. 🔎

  • Twitch-Drops-Bot

    A Node.js bot that will automatically watch Twitch streams and claim drop rewards.

    Project mention: I want to play Overwatch! Not watch someone else play it. | reddit.com/r/Overwatch | 2022-10-25
  • fingerprint-suite

    Browser fingerprinting tools for anonymizing your scrapers. Developed by Apify.

    Project mention: Crawlee · Build reliable crawlers. Fast. | Crawlee | reddit.com/r/node | 2022-08-23

    The fingerprinting system Crawlee uses is published here https://github.com/apify/fingerprint-suite

  • dark-knowledge

    😈📚 A curated library of research papers and presentations for counter-detection and web privacy enthusiasts.

    Project mention: Choose your browser carefully| by Unix Sheikh | reddit.com/r/degoogle | 2023-01-18

    Browsers are complicated and you won't find a single analysis covering all aspects. For a security analysis Madaidan's blog is a good starting point. For a privacy analysis you need to learn the common tracking methods and which solutions or mitigations are available in which browser (and if they are properly implemented). You could start by learning about the different forms of tracking through state (cookies, cache, storage, ...), which is still one of the most used tracking methods. Of course you also need to check the easy things like telemetry. Then there is fingerprinting which is a huge topic on its own. This is where it's even more fun. You need to start reading research papers, not just one but many and you need to check mitigations used in browsers and their statistical implications (data about this is unfortunately very rare).

  • mugshot

    Framework independent visual testing library

  • puppeteer-report

    Convert HTML to PDF by Puppeteer with support of adding a custom header, footer, and page number

    Project mention: Jikji - Layout library for building effortless reports in React | dev.to | 2022-05-14

    puppeteer-report provides a simple solution for having a custom header, footer, and page numbers. Still, I got a bunch of requests on GitHub for new features and had to reject most of them because puppeteer has a lot of limitations in the customization of layout and printing.

  • puppeteer-ide-extension

    Standalone puppeteer playground in browser's developer tools.

    Project mention: Puppeteer IDE in browser's developer tools | reddit.com/r/opensource | 2022-07-20
  • dakka

    Dakka - generate automation tests for Cypress, Playwright and Puppeteer

    Project mention: Would you be interested in POM generator for Selenium? | news.ycombinator.com | 2022-06-11
  • Sonar

    Write Clean JavaScript Code. Always.. Sonar helps you commit clean code every time. With over 300 unique rules to find JavaScript bugs, code smells & vulnerabilities, Sonar finds the issues while you focus on the work.

NOTE: The open source projects on this list are ordered by number of github stars. The number of mentions indicates repo mentiontions in the last 12 Months or since we started tracking (Dec 2020). The latest post mention was on 2023-01-18.

TypeScript Puppeteer related posts

Index

What are some of the best open-source Puppeteer projects in TypeScript? This list will help you:

Project Stars
1 crawlee 7,563
2 chrome 5,187
3 qawolf 3,198
4 chrome-aws-lambda 2,928
5 puppeteer-cluster 2,691
6 pwa-asset-generator 2,276
7 md-to-pdf 781
8 unlighthouse 668
9 adblocker 599
10 secret-agent 562
11 linvo-scraper 527
12 replay 453
13 tinking 383
14 deno-puppeteer 363
15 Recorder 313
16 root-cause 262
17 Twitch-Drops-Bot 235
18 fingerprint-suite 217
19 dark-knowledge 213
20 mugshot 127
21 puppeteer-report 112
22 puppeteer-ide-extension 111
23 dakka 81
Write Clean JavaScript Code. Always.
Sonar helps you commit clean code every time. With over 300 unique rules to find JavaScript bugs, code smells & vulnerabilities, Sonar finds the issues while you focus on the work.
www.sonarsource.com