Ingest, store, & analyze all types of time series data in a fully-managed, purpose-built database. Keep data forever with low-cost storage and superior data compression. Learn more →
Top 17 Python web-scraper Projects
-
For epubs, same for me. I use https://github.com/dipu-bd/lightnovel-crawler which allows you to make epub for many websites.
-
Monkey-DL (Anime Downloader)
Bulk download your favourite anime episodes from your favourite anime websites
-
Sonar
Write Clean Python Code. Always.. Sonar helps you commit clean code every time. With over 225 unique rules to find Python bugs, code smells & vulnerabilities, Sonar finds the issues while you focus on the work.
-
-
web-scraping
Detailed web scraping tutorials for dummies with financial data crawlers on Reddit WallStreetBets, CME (both options and futures), US Treasury, CFTC, LME, MacroTrends, SHFE and alternative data crawlers on Tomtom, BBC, Wall Street Journal, Al Jazeera, Reuters, Financial Times, Bloomberg, CNN, Fortune, The Economist
-
summarizer
A Reddit bot that summarizes news articles written in Spanish or English. It uses a custom built algorithm to rank words and sentences.
-
facebook_page_scraper
Scrapes facebook's pages front end with no limitations & provides a feature to turn data into structured JSON or CSV
-
CobWeb-lnx
CobWeb is a Python library for web scraping. The library consists of two classes: Spider and Scraper.
PyPi: pypi.org/project/CobWeb-lnx/
-
InfluxDB
Access the most powerful time series database as a service. Ingest, store, & analyze all types of time series data in a fully-managed, purpose-built database. Keep data forever with low-cost storage and superior data compression.
-
tagalog-dictionary-scraper
Builds a Tagalog dictionary by collecting Tagalog words from tagalog.pinoydictionary.com
-
mexican-jobs-2020
Data ETL & Analysis on thousands of job listings from the official Mexican job board (2020 edition).
-
Project mention: Any ideas? I need a bot to grab comments from a reddit post and put them on github repository | /r/github | 2023-01-05
It's not what you're directly looking for but as an example and starting point I'd check out https://github.com/PhantomInsights/reddit-bots
-
tweet-transcriber
A Reddit bot that transcribes tweets from comments and submissions links, mirrors their images and replies back with a formatted Markdown message.
-
-
Python-Web-Scraper
An adaptive Python Web Scraper App to catch the best deals by scraping and parsing data from select E-Commerce sites.
Project mention: Python Web Scraper/Crawler for E-Commerce sites. Currently supports only a few websites but im looking to expand that list. Tips/criticism are welcomed. This is the first project for my student CV (0 working experience) so I'd like it to be as polished as possible. | /r/programming | 2023-03-01 -
Abosar
অবসর 📚 A Collection Of Short Bengali Stories Web Scraped From Various Bengali eMagazines And eNewspapers.
-
-
-
-
SaaSHub
SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives
Python web-scraper related posts
- Multiparadigmatic Web Scraping Tool!
- ISSTH left me disappointed
- a discord server and bot to fetch epub chapters from novels?
- Python Web Scraper/Crawler for E-Commerce sites. Currently supports only a few websites but im looking to expand that list. Tips/criticism are welcomed. This is the first project for my student CV (0 working experience) so I'd like it to be as polished as possible.
- Wat is jullie ervaring met e-readers?
- Does the kindle have a search function? A working one? I’ve seen videos but those are like years old.
- ¿Cómo se llamaba el alma de código que resumía noticias?¿Sigue vivo?
-
A note from our sponsor - InfluxDB
www.influxdata.com | 8 Jun 2023
Index
What are some of the best open-source web-scraper projects in Python? This list will help you:
Project | Stars | |
---|---|---|
1 | lightnovel-crawler | 959 |
2 | Monkey-DL (Anime Downloader) | 744 |
3 | onlyfans-dl | 646 |
4 | web-scraping | 468 |
5 | summarizer | 259 |
6 | facebook_page_scraper | 117 |
7 | CobWeb-lnx | 31 |
8 | tagalog-dictionary-scraper | 21 |
9 | mexican-jobs-2020 | 21 |
10 | reddit-bots | 21 |
11 | tweet-transcriber | 19 |
12 | git-pull | 15 |
13 | Python-Web-Scraper | 10 |
14 | Abosar | 5 |
15 | varieteebot | 3 |
16 | nanoscrape | 0 |
17 | iw-scraper | 0 |