Our great sponsors
-
cloudproxy
Hide your scrapers IP behind the cloud. Provision proxy servers across different cloud providers to improve your scraping success.
-
SurveyJS
Open-Source JSON Form Builder to Create Dynamic Forms Right in Your App. With SurveyJS form UI libraries, you can build and style forms in a fully-integrated drag & drop form builder, render them in your JS app, and store form submission data in any backend, inc. PHP, ASP.NET Core, and Node.js.
Even though I only do it for hobby projects, crawling pages is becoming increasingly difficult unless you are a big player like Google or Microsoft with a whitelisted IP range.
I've had some success in scraping lately with a similar project called FlareSolverr(1).
It's purpose it to get you access to sites which won't let you crawl unless you are using a real browser (e.g amazon, instagram). It doesn't hide your IP but uses puppeteer with stealth mode to get you access to otherwise restricted urls.
Related posts
- Scraping Google trends, and incomplete datasets. Help, please?
- Bypass Cloudflare bot protection with regular captcha solving service
- How to force Jackett (service) to wait for VPN before it starts on Windows
- How to force Jackett (service) to wait for VPN before it starts on Windows
- Unable to connect to indexer, DNS or ipv6 error