Revolutionize your code reviews with AI. CodeRabbit offers PR summaries, code walkthroughs, 1-click suggestions, and AST-based analysis. Boost productivity and code quality across all major languages with each PR. Learn more →
Top 23 TypeScript Scraper Projects
-
firecrawl
🔥 Turn entire websites into LLM-ready markdown or structured data. Scrape, crawl and extract with a single API.
Project mention: Show HN: Get structured website data with just a prompt | news.ycombinator.com | 2025-01-20- Also, most of our work including /extract is open-source. Check it out here at https://github.com/mendableai/firecrawl
That's all for now! Let us know any feedback on /extract.
-
CodeRabbit
CodeRabbit: AI Code Reviews for Developers. Revolutionize your code reviews with AI. CodeRabbit offers PR summaries, code walkthroughs, 1-click suggestions, and AST-based analysis. Boost productivity and code quality across all major languages with each PR.
-
Scraping the Academy Award winners listed on Wikipedia with cheerio and saving them to a CSV file.
-
crawlee
Crawlee—A web scraping and browser automation library for Node.js to build reliable crawlers. In JavaScript and TypeScript. Extract data for AI, LLMs, RAG, or GPTs. Download HTML, PDF, JPG, PNG, and other files from websites. Works with Puppeteer, Playwright, Cheerio, JSDOM, and raw HTTP. Both headful and headless mode. With proxy rotation.
View on GitHub
-
maxun
Open-source no-code web data extraction platform. Turn websites to APIs & spreadsheets with no-code robots in minutes.
Project mention: Maxun: Open-Source No-Code Web Data Extraction Platform | news.ycombinator.com | 2024-11-08 -
Project mention: llm-scraper VS parsera - a user suggested alternative | libhunt.com/r/llm-scraper | 2024-10-16
-
api.consumet.org
A Modern Search Engine API for Anime, Movies/TVShows, Books, Light Novels, Manga, etc.
-
将网站转化为Epub
-
SaaSHub
SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives
-
-
-
DevDocs
Completely free, private, UI based Tech Documentation MCP server. Designed for coders and software developers in mind. Easily integrate into Cursor, Windsurf, Cline, Roo Code, Claude Desktop App (by cyberagiinc)
Project mention: Show HN: We made an MCP Server so that Cursor can build anything from API Docs | news.ycombinator.com | 2025-03-24Looks cool, the only one similar I've seen so far that is similar is: https://github.com/cyberagiinc/DevDocs
But every-time I've tried to run DevDocs, I've had issues running it. Either the scraper or the MCP server fails to run.
-
-
scraper
Nodejs web scraper. Contains a command line, docker container, terraform module and ansible roles for distributed cloud scraping. Supported databases: SQLite, MySQL, PostgreSQL. Supported headless clients: Puppeteer, Playwright, Cheerio, JSdom. (by get-set-fetch)
-
-
freenom-auto-renew-domains
A scraper built with puppeteer that auto renew free domains on Freenom and send discord message using bot
-
-
Project mention: Mkfd – RSS feed builder API created with Bun and Hono | news.ycombinator.com | 2024-11-17
-
This is the second version of this bot. The first approach was webscraper-bot, which I built because I needed to be notified about new rental apartments quickly (more about that in this post). Some people started discovering the bot, and after a few months, I had around 100 users, but there was one big problem. Over 90% of the users didn't manage to create a single scraping bot because it required a query selector to be inserted. So how I interpreted it was:
-
-
passport-appointment-bot
An automated bot designed to seamlessly book appointments for the renewal or creation of Swedish passports or national ID cards.
-
-
-
-
-
SaaSHub
SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives
TypeScript Scraper discussion
TypeScript Scraper related posts
-
Show HN: Get structured website data with just a prompt
-
Show HN: Llms.txt Generator – Turn websites into a text file to feed to any LLM
-
Maxun: Open-Source No-Code Web Data Extraction Platform
-
Maxun: Open-Source No-Code Web Data Extraction Platform
-
Maxun: Open Source No-Code Web Data Extraction Platform⚡️
-
Firecrawl: Turn entire websites into LLM-ready Markdown or structured data
-
Overcoming Common Web Scraping Challenges with Firecrawl, an open-source AI tool
-
A note from our sponsor - CodeRabbit
coderabbit.ai | 25 Mar 2025
Index
What are some of the best open-source Scraper projects in TypeScript? This list will help you:
# | Project | Stars |
---|---|---|
1 | firecrawl | 31,797 |
2 | cheerio | 29,221 |
3 | crawlee | 17,200 |
4 | maxun | 9,604 |
5 | llm-scraper | 4,635 |
6 | api.consumet.org | 1,330 |
7 | epublifier | 774 |
8 | linvo-scraper | 612 |
9 | HLTV | 419 |
10 | DevDocs | 350 |
11 | mwoffliner | 326 |
12 | scraper | 110 |
13 | extension | 81 |
14 | freenom-auto-renew-domains | 51 |
15 | vercel-metafy | 49 |
16 | mkfd | 63 |
17 | webscraper-bot | 29 |
18 | Philia | 25 |
19 | passport-appointment-bot | 24 |
20 | botasaurus-starter | 23 |
21 | scrapyteer | 19 |
22 | forward-proxy-manager | 12 |
23 | wallace-apple-dictionary | 11 |