parse5
HTML parsing/serialization toolset for Node.js. WHATWG HTML Living Standard (aka HTML5)-compliant. (by inikulin)
x-ray
The next web scraper. See through the <html> noise. (by matthewmueller)
parse5 | x-ray | |
---|---|---|
4 | 3 | |
3,649 | 5,873 | |
- | - | |
9.2 | 0.0 | |
1 day ago | 28 days ago | |
TypeScript | JavaScript | |
MIT License | MIT License |
The number of mentions indicates the total number of mentions that we've tracked plus the number of user suggested alternatives.
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
parse5
Posts with mentions or reviews of parse5.
We have used some of these posts to build our list of alternatives
and similar projects. The last one was on 2022-12-21.
- error of installing icon library
-
casperjs, phantomjs, what is not going to be abandonware?
A relatively stable option would probably be to just use puppeteer directly to spawn a headless chrome, and extract the html that way. If you want to parse the html, I recommend feeding that into parse5.
-
Getting Started with Deno
After some googling, I landed on parse5 which appeared to have wide usage and offered a simple, low-level tree API at its core.
-
How does session replay work Part1: Serialization
We do not use existing open-source solutions such as parse5 for two reasons:
x-ray
Posts with mentions or reviews of x-ray.
We have used some of these posts to build our list of alternatives
and similar projects. The last one was on 2023-03-22.
-
The best Node.js web scrapers for your use case
X-Ray is a Node.js library created for scraping the web. So, it is no surprise that its API is heavily focused on that task. Thus, it abstracts most of the complexity we have seen in Puppeteer and Axios from developers.
-
Can anyone scrape the names and prices of all the products from this website?
I used x-ray (Javascript/Nodejs) to get your data. This is all the code I needed:
-
New To x-ray... actually new to Node all together
You talking about this package? https://www.npmjs.com/package/x-ray
What are some alternatives?
When comparing parse5 and x-ray you can also consider the following projects:
JSONStream
excel-stream
nearley - 📜🔜🌲 Simple, fast, powerful parser toolkit for JavaScript.
strip-json-comments - Strip comments from JSON. Lets you use comments in your JSON files!
xml2js - XML to JavaScript object converter.
neat-csv - Fast CSV parser
URI.js - Javascript URL mutation library
Chevrotain - Parser Building Toolkit for JavaScript
json-query - Retrieves values from JSON objects for data binding
PEG.js - PEG.js: Parser generator for JavaScript