Collect, organize, and act on massive volumes of high-resolution data to power real-time intelligent systems. Learn more →
Top 9 Python Extractor Projects
-
news-please
news-please - an integrated web crawler and information extractor for news that just works
-
Judoscale
Save 47% on cloud hosting with autoscaling that just works. Judoscale integrates with Django, FastAPI, Celery, and RQ to make autoscaling easy and reliable. Save big, and say goodbye to request timeouts and backed-up task queues.
-
Project mention: Show HN: I made a faster, mobile-friendly interface for Wiktionary | news.ycombinator.com | 2025-04-12
-
-
URLExtract
URLExtract is python class for collecting (extracting) URLs from given text based on locating TLD.
-
-
-
-
CodeRabbit
CodeRabbit: AI Code Reviews for Developers. Revolutionize your code reviews with AI. CodeRabbit offers PR summaries, code walkthroughs, 1-click suggestions, and AST-based analysis. Boost productivity and code quality across all major languages with each PR.
-
-
unbumblebee
Python script to extract the C&C configuration from an active Bumblebee process through PE-Sieve
Python Extractor discussion
Python Extractor related posts
-
Show HN: I made a faster, mobile-friendly interface for Wiktionary
-
Wiktionary dump file parser and multilingual data extractor
-
Dynamically generating minimal pair decks for Anki
-
What are some of the best digital free dictionaries available online (even for commercial use)?
-
Best Approach to importing a languages dictionary
-
This is not perfect but it's a start
-
MikeMeliz/TorCrawl.py - Crawl and extract (regular or onion) webpages through TOR network
-
A note from our sponsor - InfluxDB
influxdata.com | 23 Apr 2025
Index
What are some of the best open-source Extractor projects in Python? This list will help you:
# | Project | Stars |
---|---|---|
1 | news-please | 2,205 |
2 | wiktextract | 888 |
3 | TorCrawl.py | 380 |
4 | URLExtract | 253 |
5 | PyVideoFramesExtractor | 40 |
6 | AI-image-tag-extractor | 40 |
7 | snap-lens-tool | 20 |
8 | MPKExtractor | 11 |
9 | unbumblebee | 7 |