secutils-web-scraper

The web scrapper component of Secutils.dev (by secutils-dev)

Secutils-web-scraper Alternatives

Similar projects and alternatives to secutils-web-scraper based on common topics and language

NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a better secutils-web-scraper alternative or higher similarity.

secutils-web-scraper reviews and mentions

Posts with mentions or reviews of secutils-web-scraper. We have used some of these posts to build our list of alternatives and similar projects. The last one was on 2024-01-16.
  • How to track anything on the internet or use Playwright for fun and profit
    5 projects | dev.to | 16 Jan 2024
    To begin, all functionality related to browser automation and web scraping lives in a dedicated service — Web Scraper. The primary rationale is that dealing with browsers and arbitrary user scripts is tricky from a security standpoint, and it's always a good idea to isolate such functionality as much as possible. You can read more about the security aspects of web scraping in the "Running web scraping service securely" post.
  • Running web scraping service securely
    4 projects | dev.to | 19 Oct 2023
    When it comes to web page resource scraping, Secutils.dev relies on a separate component - secutils-dev/secutils-web-scraper. I've built it on top of Playwright since I need to handle both resources that are statically defined in the HTML and those that are loaded dynamically. Leveraging Playwright, backed by a real browser, instead of parsing the static HTML opens up a ton of opportunities to turn a simple web resource scraper into a much more intelligent tool capable of handling all sorts of use cases: recording and replaying HARs, imitating user activity, and more.
  • Detecting changes in JavaScript and CSS isn't an easy task, Part 1
    1 project | dev.to | 7 Sep 2023
    While both Puppeteer and Playwright have their own advantages and disadvantages, I have chosen Playwright for Secutils.dev. Playwright not only allows us to access all browser APIs within the web page context to easily detect and extract inline resources, but also enables us to intercept all external dynamically loaded web page resources. Here's an example of the code (full code can be found here):
  • A note from our sponsor - InfluxDB
    www.influxdata.com | 5 May 2024
    Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality. Learn more →

Stats

Basic secutils-web-scraper repo stats
3
1
8.4
4 months ago

secutils-dev/secutils-web-scraper is an open source project licensed under GNU Affero General Public License v3.0 which is an OSI approved license.

The primary programming language of secutils-web-scraper is TypeScript.


Sponsored
SaaSHub - Software Alternatives and Reviews
SaaSHub helps you find the best software and product alternatives
www.saashub.com