With SurveyJS form UI libraries, you can build and style forms in a fully-integrated drag & drop form builder, render them in your JS app, and store form submission data in any backend, inc. PHP, ASP.NET Core, and Node.js. Learn more →
Top 23 TypeScript Puppeteer Projects
-
crawlee
Crawlee—A web scraping and browser automation library for Node.js to build reliable crawlers. In JavaScript and TypeScript. Extract data for AI, LLMs, RAG, or GPTs. Download HTML, PDF, JPG, PNG, and other files from websites. Works with Puppeteer, Playwright, Cheerio, JSDOM, and raw HTTP. Both headful and headless mode. With proxy rotation.
-
browserless
Deploy headless browsers in Docker. Run on our cloud or bring your own. Free for non-commercial uses.
-
SurveyJS
Open-Source JSON Form Builder to Create Dynamic Forms Right in Your App. With SurveyJS form UI libraries, you can build and style forms in a fully-integrated drag & drop form builder, render them in your JS app, and store form submission data in any backend, inc. PHP, ASP.NET Core, and Node.js.
-
pwa-asset-generator
Automates PWA asset generation and image declaration. Automatically generates icon and splash screen images, favicons and mstile images. Updates manifest.json and index.html files with the generated images according to Web App Manifest specs and Apple Human Interface guidelines.
-
WorkOS
The modern identity platform for B2B SaaS. The APIs are flexible and easy-to-use, supporting authentication, user identity, and complex enterprise features like SSO and SCIM provisioning.
-
Spearmint
Testing, simplified. || An inclusive, accessibility-first GUI for generating clean, semantic Javascript tests in only a few clicks of a button. (by open-source-labs)
-
x-crawl
x-crawl is a flexible Node.js multifunctional crawler library. Flexible usage and numerous functions can help you quickly, safely, and stably crawl pages, interfaces, and files.
-
BotD
Bot detection library that runs in the browser. Detects automation tools and frameworks. No server required, runs 100% on the client. MIT license, no usage restrictions.
-
replay
Library that provides an API to replay and stringify recordings created using Chrome DevTools Recorder (by puppeteer)
-
Recorder
A browser extension that generates Cypress, Playwright and Puppeteer test scripts from your interactions 🖱 ⌨ (by DeploySentinel)
-
puppeteer-report
Convert HTML to PDF by Puppeteer with support of adding a custom header, footer, and page number
-
InfluxDB
Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
In this guide, we'll be extracting information from Amazon product pages using the power of TypeScript in combination with the Cheerio and Crawlee libraries. We'll explore how to retrieve and extract detailed product data such as titles, prices, image URLs, and more from Amazon's vast marketplace. We'll also discuss handling potential blocking issues that may arise during the scraping process.
Project mention: How and why we ripped our Open Source product apart for a full rebuild | dev.to | 2024-02-28The core product is managed, cloud hosted browsers. We run thousands at a time using AWS and DigitalOcean, for people to use with Puppeteer and Playwright scripts. Our container is also available to self deploy under an open-source license.
Project mention: How To Generate Icons for a Progressive Web App from SVG File With a Single Command | dev.to | 2023-07-30To generate icons, we use pwa-asset-generator. The first command generates a favicon icon with a transparent background, the second one creates all the necessary icons for a progressive web app, and the third one creates images for splash screens. The last command is optional, in case you have an icon for dark mode.
If the crates.io team wanted to go further they could employ some invasive methods to detect bots (usually it involves a JS library that does fingerprinting on the browser - something like BotD), but I'm not advocating for it. I don't think crates.io should collect more data, they should just perform better statistics on the data they already have.
TypeScript Puppeteer related posts
- How and why we ripped our Open Source product apart for a full rebuild
- Eleve o nível de suas Aplicações Javascript com Load Test
- Self-hosted browserless.io alternative ?
- Unlighthouse - scan you entire website
- A PageSpeed insight report, but for every page in your domain
- Unlighthouse: Like Lighthouse, but it scans every single page
- Unlighthouse: Like Lighthouse, but it scans every single page
-
A note from our sponsor - SurveyJS
surveyjs.io | 25 Apr 2024
Index
What are some of the best open-source Puppeteer projects in TypeScript? This list will help you:
Project | Stars | |
---|---|---|
1 | crawlee | 12,044 |
2 | browserless | 7,842 |
3 | jest-puppeteer | 3,519 |
4 | qawolf | 3,273 |
5 | chrome-aws-lambda | 3,136 |
6 | puppeteer-cluster | 3,083 |
7 | pwa-asset-generator | 2,628 |
8 | Spearmint | 1,283 |
9 | md-to-pdf | 1,067 |
10 | x-crawl | 1,016 |
11 | BotD | 903 |
12 | replay | 871 |
13 | adblocker | 728 |
14 | fingerprint-suite | 696 |
15 | secret-agent | 633 |
16 | linvo-scraper | 589 |
17 | deno-puppeteer | 439 |
18 | Recorder | 399 |
19 | Twitch-Drops-Bot | 279 |
20 | puppeteer-ide-extension | 184 |
21 | puppeteer-report | 137 |
22 | mugshot | 135 |
23 | phishim | 125 |
Sponsored