puppet-scraper vs browserless

With SurveyJS form UI libraries, you can build and style forms in a fully-integrated drag & drop form builder, render them in your JS app, and store form submission data in any backend, inc. PHP, ASP.NET Core, and Node.js.

surveyjs.io

featured

InfluxDB - Power Real-Time Data Analytics at Scale

Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

www.influxdata.com

featured

puppet-scraper		browserless
	Project
1	Mentions	21
29	Stars	7,893
-	Growth	8.1%
10.0	Activity	9.8
about 2 years ago	Latest Commit	5 days ago
TypeScript	Language	TypeScript
MIT License	License	GNU General Public License v3.0 or later

The number of mentions indicates the total number of mentions that we've tracked plus the number of user suggested alternatives.
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.

puppet-scraper

Posts with mentions or reviews of puppet-scraper. We have used some of these posts to build our list of alternatives and similar projects. The last one was on 2022-10-14.

Web Search and Scrape
3 projects | /r/javahelp | 14 Oct 2022

I'd look into scraping with JavaScript. Something like this could meet your needs.

browserless

Posts with mentions or reviews of browserless. We have used some of these posts to build our list of alternatives and similar projects. The last one was on 2023-04-06.

How and why we ripped our Open Source product apart for a full rebuild
1 project | dev.to | 28 Feb 2024

The core product is managed, cloud hosted browsers. We run thousands at a time using AWS and DigitalOcean, for people to use with Puppeteer and Playwright scripts. Our container is also available to self deploy under an open-source license.
Self-hosted browserless.io alternative ?
1 project | /r/webscraping | 18 May 2023

You should search for "Puppeteer as a service", there are some projects on github that you could deploy such as https://github.com/browserless/chrome
Remote Server Compromised
7 projects | /r/selfhosted | 6 Apr 2023

So I recently installed ChangeDetectioIO on my server, it requires either selenium/standalone-chrome-debug:3.141.59 or browserless/chrome. I installed it with Selenium in a docker container since I noticed that it was running better than the browserless/chrome service.
Angular docker base image
1 project | /r/angular | 20 Dec 2022

I had a look to this one: https://github.com/browserless/chrome ... but it is not suitable for builds, e.g. set to production mode, user permissions and so on.
browserless chrome (Web browser automation built for everyone)
1 project | /r/selfhosted | 10 Sep 2022
Ask HN: What are the best tools for web scraping in 2022?
33 projects | news.ycombinator.com | 10 Aug 2022
Using changedetection.io (installed via pip, not docker). How do I set up "WebDriver Chrome/Javascript"
1 project | /r/selfhosted | 29 Jul 2022

git clone https://github.com/browserless/chrome /opt/browserless
How to automate PDF generation of dashboards/web pages with open-source web automation
1 project | /r/opensource | 2 May 2022
Starring your repo does not give you permission to spam me
1 project | /r/patient_hackernews | 22 Mar 2022

1 project | /r/hackernews | 22 Mar 2022

What are some alternatives?

When comparing puppet-scraper and browserless you can also consider the following projects:

rendertron - A Headless Chrome rendering solution

Dompdf - HTML to PDF converter for PHP

crawlee - Crawlee—A web scraping and browser automation library for Node.js to build reliable crawlers. In JavaScript and TypeScript. Extract data for AI, LLMs, RAG, or GPTs. Download HTML, PDF, JPG, PNG, and other files from websites. Works with Puppeteer, Playwright, Cheerio, JSDOM, and raw HTTP. Both headful and headless mode. With proxy rotation.

PHP-Proxy - Proxy Application built on php-proxy library ready to be installed on your server

Twitch-Drops-Bot - A Node.js bot that will automatically watch Twitch streams and claim drop rewards.

browsershot - Convert HTML to an image, PDF or string

selenoid - Selenium Hub successor running browsers within containers. Scalable, immutable, self hosted Selenium-Grid on any platform with single binary.

FPDI - FPDI is a collection of PHP classes facilitating developers to read pages from existing PDF documents and use them as templates in FPDF.

mPDF - PHP library generating PDF files from UTF-8 encoded HTML

pagedjs - Display paginated content in the browser and generate print books using web technology

Custom-Scenes - Please go to https://github.com/Notexe/h3-custom-scenes instead. Hitman 3 custom scene experimentation using ResourceTool + QuickEntity + simple-mod-framework + RPKG Tool

aws-lambda-layer-node-puppeteer-headless-chromium