spidergram
scrapyteer
spidergram | scrapyteer | |
---|---|---|
3 | 1 | |
101 | 18 | |
4.0% | - | |
8.0 | 4.0 | |
about 1 month ago | 2 months ago | |
TypeScript | TypeScript | |
GNU General Public License v3.0 only | MIT License |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
spidergram
- Data Scrape Tool for Existing Designs?
- Sitemap Generator/Crawl for an Authenticated website
- Spidergram is a collection of tools my company Autogram has built or enabled over the past several years to support our work to automate content inventories for large websites: it's part web crawler, part domain model, and part mad science. We released the first public beta today.
scrapyteer
-
Low-code Node.js web scraping tool
Hi guys, I've created an open-source low-code Node.js web scraping tool on top of the Puppeteer - https://github.com/miroshnikov/scrapyteer. It offers a small set of functions that are combined in pipelines to define a crawling workflow and a shape of output data. Maybe somebody will find it useful.
What are some alternatives?
crawlee - Crawlee—A web scraping and browser automation library for Node.js to build reliable crawlers. In JavaScript and TypeScript. Extract data for AI, LLMs, RAG, or GPTs. Download HTML, PDF, JPG, PNG, and other files from websites. Works with Puppeteer, Playwright, Cheerio, JSDOM, and raw HTTP. Both headful and headless mode. With proxy rotation.
Philia - An easy to use imageboard scraper.
outlook-account-generator - Outlook Account Generator helps you create outlook accounts.
crawler - Library for Rapid (Web) Crawler and Scraper Development
squirm - This was the night of the crawling terror!
Dataflow kit - Extract structured data from web sites. Web sites scraping.
ayakashi - :zap: Ayakashi.io - The next generation web scraping framework
Crawly - Crawly, a high-level web crawling & scraping framework for Elixir.
awesome-web-scraping - List of libraries, tools and APIs for web scraping and data processing.
scraper - All In One API to easily scrape data from any website, without worrying about captchas and bot detection mecanisms.
blog-article-protection-scraping-headless-browser - Repository related to post in my blog. Visit it to more details.