Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality. Learn more →
Top 14 Python Extract Projects
-
video-subtitle-extractor
视频硬字幕提取,生成srt文件。无需申请第三方API,本地实现文本识别。基于深度学习的视频字幕提取框架,包含字幕区域检测、字幕内容提取。A GUI tool for extracting hard-coded subtitle (hardsub) from videos and generating srt files.
-
InfluxDB
Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
-
URLExtract
URLExtract is python class for collecting (extracting) URLs from given text based on locating TLD.
-
pdf2doi
A python library/command-line tool to extract the DOI or other identifiers of a scientific paper from a pdf file.
-
SaaSHub
SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives
-
grablinks
A simple and streamlined Python script to extract and filter links from a remote HTML resource.
-
NewPipePlaylistExtractor
Download your NewPipe created playlists as mp3, wav or other codec and listen to it offline.
-
docxlatex
A python library for extracting text from .docx files with support for inserted mathematical equations
-
SaaSHub
SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives
Project mention: How do you parse tables in PDF with langchain? Especially, the context which is few lines above and below the table. | /r/LangChain | 2023-06-23
Project mention: Ask HN: Freelancer? Seeking freelancer? (December 2023) | news.ycombinator.com | 2023-12-03SEEKING FREELANCER | REMOTE | GERMANY
dltHub is looking for a freelance help in the following repos:
- https://github.com/dlt-hub/dlt
Project mention: How can I scrape every .sensorpanel attachment from this thread? | /r/DataHoarder | 2023-12-05You can grab my grablinks.py Python3 script from here: https://github.com/the-real-tokai/grablinks
I realize there may be some other ways to achieve what I'm trying to achieve (for example using docxlatex to get something similar). I'm just wondering if there's an easier way to access the equations of a word document in linear form.
Python Extract related posts
-
How can I scrape every .sensorpanel attachment from this thread?
-
Ratarmount: Random Access Tar Mount
-
Figured out how to combine Google Earth tiles into a single glTF, load it into Blender or any game engine like PlayCanvas
-
How to UV unwrap a large object (1 million verts) using Smart UV Project? Would love to have it all unwrapped on one object if possible...
-
Ratarmount – Fast transparent access to archives through FUSE
-
Is there a way to accelerate extracting .tar contents?
-
How to make thunar show exe files thumbnails? Gnome 40, Fedora 34
-
A note from our sponsor - InfluxDB
www.influxdata.com | 10 May 2024
Index
What are some of the best open-source Extract projects in Python? This list will help you:
Project | Stars | |
---|---|---|
1 | video-subtitle-extractor | 4,889 |
2 | camelot | 3,553 |
3 | dlt | 1,758 |
4 | URLExtract | 236 |
5 | open-mcr | 147 |
6 | icoextract | 102 |
7 | pdf2doi | 84 |
8 | HDR-Multi-Tool | 71 |
9 | grablinks | 23 |
10 | NewPipePlaylistExtractor | 14 |
11 | MPKExtractor | 9 |
12 | Spooq | 8 |
13 | docxlatex | 7 |
14 | AutomaticDemuxer | 2 |
Sponsored