Python Extract

Open-source Python projects categorized as Extract

Top 14 Python Extract Projects

  • video-subtitle-extractor

    视频硬字幕提取,生成srt文件。无需申请第三方API,本地实现文本识别。基于深度学习的视频字幕提取框架,包含字幕区域检测、字幕内容提取。A GUI tool for extracting hard-coded subtitle (hardsub) from videos and generating srt files.

  • camelot

    Camelot: PDF Table Extraction for Humans (by atlanhq)

  • Project mention: How do you parse tables in PDF with langchain? Especially, the context which is few lines above and below the table. | /r/LangChain | 2023-06-23
  • InfluxDB

    Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

    InfluxDB logo
  • dlt

    data load tool (dlt) is an open source Python library that makes data loading easy 🛠️

  • Project mention: Ask HN: Freelancer? Seeking freelancer? (December 2023) | news.ycombinator.com | 2023-12-03

    SEEKING FREELANCER | REMOTE | GERMANY

    dltHub is looking for a freelance help in the following repos:

    - https://github.com/dlt-hub/dlt

  • URLExtract

    URLExtract is python class for collecting (extracting) URLs from given text based on locating TLD.

  • open-mcr

    :pencil: Exam bubble sheet scorer. Created with OpenCV and Python.

  • icoextract

    Extract icons from Windows PE files (.exe/.dll)

  • pdf2doi

    A python library/command-line tool to extract the DOI or other identifiers of a scientific paper from a pdf file.

  • SaaSHub

    SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives

    SaaSHub logo
  • HDR-Multi-Tool

    A graphical user interface for parsing HDR10+ and Dolby Vision

  • Project mention: New to Downsizing, Have some basic questions | /r/handbrake | 2023-12-07
    Project mention: How can I scrape every .sensorpanel attachment from this thread? | /r/DataHoarder | 2023-12-05

    You can grab my grablinks.py Python3 script from here: https://github.com/the-real-tokai/grablinks

  • NewPipePlaylistExtractor

    Download your NewPipe created playlists as mp3, wav or other codec and listen to it offline.

  • MPKExtractor

    Simple extractor script for Diablo Immortal's .MPK files

  • Spooq

  • docxlatex

    A python library for extracting text from .docx files with support for inserted mathematical equations

  • Project mention: Copy word documents to clipboard | /r/learnpython | 2023-05-23

    I realize there may be some other ways to achieve what I'm trying to achieve (for example using docxlatex to get something similar). I'm just wondering if there's an easier way to access the equations of a word document in linear form.

  • AutomaticDemuxer

    Automatically Demux tracks from media-files

  • SaaSHub

    SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives

    SaaSHub logo
NOTE: The open source projects on this list are ordered by number of github stars. The number of mentions indicates repo mentiontions in the last 12 Months or since we started tracking (Dec 2020).

Python Extract related posts

  • How can I scrape every .sensorpanel attachment from this thread?

    1 project | /r/DataHoarder | 5 Dec 2023
  • Ratarmount: Random Access Tar Mount

    1 project | news.ycombinator.com | 14 May 2023
  • Figured out how to combine Google Earth tiles into a single glTF, load it into Blender or any game engine like PlayCanvas

    2 projects | /r/computergraphics | 13 May 2023
  • How to UV unwrap a large object (1 million verts) using Smart UV Project? Would love to have it all unwrapped on one object if possible...

    2 projects | /r/blenderhelp | 26 Nov 2022
  • Ratarmount – Fast transparent access to archives through FUSE

    2 projects | news.ycombinator.com | 10 Mar 2022
  • Is there a way to accelerate extracting .tar contents?

    1 project | /r/linuxquestions | 29 Jun 2021
  • How to make thunar show exe files thumbnails? Gnome 40, Fedora 34

    1 project | /r/gnome | 17 May 2021
  • A note from our sponsor - InfluxDB
    www.influxdata.com | 10 May 2024
    Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality. Learn more →

Index

What are some of the best open-source Extract projects in Python? This list will help you:

Project Stars
1 video-subtitle-extractor 4,889
2 camelot 3,553
3 dlt 1,758
4 URLExtract 236
5 open-mcr 147
6 icoextract 102
7 pdf2doi 84
8 HDR-Multi-Tool 71
9 grablinks 23
10 NewPipePlaylistExtractor 14
11 MPKExtractor 9
12 Spooq 8
13 docxlatex 7
14 AutomaticDemuxer 2

Sponsored
SaaSHub - Software Alternatives and Reviews
SaaSHub helps you find the best software and product alternatives
www.saashub.com