crawlee

Crawlee—A web scraping and browser automation library for Node.js to build reliable crawlers. In JavaScript and TypeScript. Extract data for AI, LLMs, RAG, or GPTs. Download HTML, PDF, JPG, PNG, and other files from websites. Works with Puppeteer, Playwright, Cheerio, JSDOM, and raw HTTP. Both headful and headless mode. With proxy rotation. (by apify)

Crawlee Alternatives

Similar projects and alternatives to crawlee

  1. Playwright

    539 crawlee VS Playwright

    Playwright is a framework for Web Testing and Automation. It allows testing Chromium, Firefox and WebKit with a single API.

  2. SaaSHub

    SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives

    SaaSHub logo
  3. puppeteer

    412 crawlee VS puppeteer

    JavaScript API for Chrome and Firefox

  4. Protobuf

    Protocol Buffers - Google's data interchange format

  5. Scrapy

    198 crawlee VS Scrapy

    Scrapy, a fast high-level web crawling & scraping framework for Python.

  6. zstd

    Zstandard - Fast real-time compression algorithm

  7. httpx

    74 crawlee VS httpx

    A next generation HTTP client for Python. 🦋

  8. Selenium WebDriver

    A browser automation framework and ecosystem.

  9. SheetJS js-xlsx

    69 crawlee VS SheetJS js-xlsx

    📗 SheetJS Spreadsheet Data Toolkit -- New home https://git.sheetjs.com/SheetJS/sheetjs

  10. jsdom

    66 crawlee VS jsdom

    A JavaScript implementation of various web standards, for use with Node.js

  11. cheerio

    63 crawlee VS cheerio

    The fast, flexible, and elegant library for parsing and manipulating HTML and XML.

  12. undetected-chromedriver

    Custom Selenium Chromedriver | Zero-Config | Passes ALL bot mitigation systems (like Distil / Imperva/ Datadadome / CloudFlare IUAM)

  13. curl-impersonate

    curl-impersonate: A special build of curl that can impersonate Chrome & Firefox

  14. brotli

    34 crawlee VS brotli

    Brotli compression format

  15. crawlee-python

    Crawlee—A web scraping and browser automation library for Python to build reliable crawlers. Extract data for AI, LLMs, RAG, or GPTs. Download HTML, PDF, JPG, PNG, and other files from websites. Works with Parsel, BeautifulSoup, Playwright, and raw HTTP. Both headful and headless mode. With proxy rotation.

  16. scrapy-playwright

    🎭 Playwright integration for Scrapy

  17. firebase-signups-to-google-chat

    Be notified of new signups in your app directly in Google Chat

  18. estela

    10 crawlee VS estela

    estela, an elastic web scraping cluster 🕸

  19. botasaurus

    8 crawlee VS botasaurus

    The All in One Framework to Build Undefeatable Scrapers

  20. packages

    38 crawlee VS packages

    💎 Monorepository for Stylify packages. Stylify uses CSS-like selectors to generate Extremely optimized utility-first CSS dynamically based on what you write 💎. (by stylify)

NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a better crawlee alternative or higher similarity.

crawlee discussion

Log in or Post with

crawlee reviews and mentions

Posts with mentions or reviews of crawlee. We have used some of these posts to build our list of alternatives and similar projects. The last one was on 2026-06-04.
  • hasdata-cli VS crawlee - a user suggested alternative
    2 projects | 4 Jun 2026
  • Five overlooked packages running my AI directory stack
    5 projects | dev.to | 22 May 2026
    I haven't shipped Crawlee yet, but it's been on my bookmarks list since I started building the itch.io ETL. My current approach is fetch + manual parsing, which works for known endpoints. Crawlee adds request queue persistence, rate limiting, and a cheerio integration for HTML extraction, all in TypeScript with native ESM support.
  • I Built 34 Web Scrapers — Here's What I Learned About Anti-Bot Detection
    1 project | dev.to | 18 Mar 2026
    After trying Scrapy, Playwright, raw Axios, and half a dozen other tools, I settled on Crawlee with Puppeteer. Here's why:
  • I Built 23 Free Web Scrapers on Apify — Here is What I Learned
    1 project | dev.to | 17 Feb 2026
    If you aren't using Crawlee, you're playing on hard mode. It’s the engine behind all my scrapers. It handles the boring stuff—request retries, proxy rotation, and session management—so I can focus on the parsing logic. The CheerioCrawler is my favorite for speed, while PlaywrightCrawler is my heavy hitter for dynamic sites.
  • How to Scrape Made-in-China.com for B2B Product Data
    2 projects | dev.to | 14 Feb 2026
    Here's a working scraper using Crawlee (CheerioCrawler) that extracts product data from search results:
  • How to Bypass reCAPTCHA and Turnstile in Crawlee with CapSolver
    1 project | dev.to | 24 Dec 2025
    Crawlee is a powerful, open-source web scraping and browser automation library for Node.js. It's built to create reliable, production-ready crawlers that can mimic human behavior and evade basic bot detection.
  • How I Block All 26M of Your Curl Requests
    6 projects | news.ycombinator.com | 2 Oct 2025
    What I have seen it is hard to tell what "serious scrapers" use. They use many things. Some use this, some not. This is what I have learned reading webscraping on reddit. Nobody speaks things like that out loud.

    There are many tools, see links below

    Personally I think that running selenium can be a bottle neck, as it does not play nice, sometimes processes break, even system sometimes requires restart because of things blocked, can be memory hog, etc. etc. That is my experience.

    To be able to scale I think you have to have your own implementation. Serious scrapers complain about people using selenium, or derivatives as noobs, who will come back asking why page X does not work in scraping mechanisms.

    https://github.com/lexiforest/curl_cffi

    https://github.com/encode/httpx

    https://github.com/scrapy/scrapy

    https://github.com/apify/crawlee

  • Scraperr – A Self Hosted Webscraper
    6 projects | news.ycombinator.com | 11 May 2025
    If you're a fan of Playwright check out Crawlee [0]. I've used it for a few small projects and it's been faster for me to get what I've needed done.

    [0] https://crawlee.dev/

  • How to scrape TikTok using Python
    5 projects | dev.to | 30 Apr 2025
    uvx crawlee['cli'] create tiktok-crawlee --crawler-type playwright --http-client httpx --package-manager uv --apify --start-url 'https://crawlee.dev'
  • Inside implementing SuperScraper with Crawlee.
    3 projects | dev.to | 5 Mar 2025
    View on GitHub
  • A note from our sponsor - SaaSHub
    www.saashub.com | 13 Jun 2026
    SaaSHub helps you find the best software and product alternatives Learn more →

Stats

Basic crawlee repo stats
54
23,755
9.5
1 day ago

Sponsored
SaaSHub - Software Alternatives and Reviews
SaaSHub helps you find the best software and product alternatives
www.saashub.com