Python Extractor

Open-source Python projects categorized as Extractor

Top 9 Python Extractor Projects

  1. news-please

    news-please - an integrated web crawler and information extractor for news that just works

  2. Judoscale

    Save 47% on cloud hosting with autoscaling that just works. Judoscale integrates with Django, FastAPI, Celery, and RQ to make autoscaling easy and reliable. Save big, and say goodbye to request timeouts and backed-up task queues.

    Judoscale logo
  3. wiktextract

    Wiktionary dump file parser and multilingual data extractor

    Project mention: Show HN: I made a faster, mobile-friendly interface for Wiktionary | news.ycombinator.com | 2025-04-12
  4. TorCrawl.py

    Crawl and extract (regular or onion) webpages through TOR network

  5. URLExtract

    URLExtract is python class for collecting (extracting) URLs from given text based on locating TLD.

  6. PyVideoFramesExtractor

    Extract frames from videos in Python using OpenCV.

  7. AI-image-tag-extractor

    A tool to help you get image info.

  8. snap-lens-tool

    A Swiss Army Knife for Snapchat Lenses.

  9. CodeRabbit

    CodeRabbit: AI Code Reviews for Developers. Revolutionize your code reviews with AI. CodeRabbit offers PR summaries, code walkthroughs, 1-click suggestions, and AST-based analysis. Boost productivity and code quality across all major languages with each PR.

    CodeRabbit logo
  10. MPKExtractor

    Simple extractor script for Diablo Immortal's .MPK files

  11. unbumblebee

    Python script to extract the C&C configuration from an active Bumblebee process through PE-Sieve

NOTE: The open source projects on this list are ordered by number of github stars. The number of mentions indicates repo mentiontions in the last 12 Months or since we started tracking (Dec 2020).

Python Extractor discussion

Log in or Post with

Python Extractor related posts

  • Show HN: I made a faster, mobile-friendly interface for Wiktionary

    2 projects | news.ycombinator.com | 12 Apr 2025
  • Wiktionary dump file parser and multilingual data extractor

    1 project | news.ycombinator.com | 2 Apr 2023
  • Dynamically generating minimal pair decks for Anki

    1 project | /r/languagelearning | 18 Jun 2022
  • What are some of the best digital free dictionaries available online (even for commercial use)?

    1 project | /r/languagelearning | 2 Jan 2022
  • Best Approach to importing a languages dictionary

    1 project | /r/AskProgramming | 8 Oct 2021
  • This is not perfect but it's a start

    1 project | /r/languagelearning | 22 Sep 2021
  • MikeMeliz/TorCrawl.py - Crawl and extract (regular or onion) webpages through TOR network

    1 project | /r/bag_o_news | 16 Jan 2021
  • A note from our sponsor - InfluxDB
    influxdata.com | 23 Apr 2025
    Collect, organize, and act on massive volumes of high-resolution data to power real-time intelligent systems. Learn more →

Index

What are some of the best open-source Extractor projects in Python? This list will help you:

# Project Stars
1 news-please 2,205
2 wiktextract 888
3 TorCrawl.py 380
4 URLExtract 253
5 PyVideoFramesExtractor 40
6 AI-image-tag-extractor 40
7 snap-lens-tool 20
8 MPKExtractor 11
9 unbumblebee 7

Sponsored
Save 47% on cloud hosting with autoscaling that just works
Judoscale integrates with Django, FastAPI, Celery, and RQ to make autoscaling easy and reliable. Save big, and say goodbye to request timeouts and backed-up task queues.
judoscale.com

Did you know that Python is
the 2nd most popular programming language
based on number of references?