mexican-government-report
py-pdf-parser
Our great sponsors
mexican-government-report | py-pdf-parser | |
---|---|---|
2 | 2 | |
481 | 335 | |
-0.4% | - | |
0.0 | 4.8 | |
over 4 years ago | 10 days ago | |
Python | Python | |
MIT License | MIT License |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
mexican-government-report
- Extract text from PDF
-
[For Hire] Data Analysis, Bots, Web Scrapers & Automation Software
Mexican Government Report Text Analysis using spaCy, NumPy, pandas, Matplotlib, Seaborn and geopandas.
py-pdf-parser
-
Need free/low-cost software that allows me to view the tags in a PDF.
Maybe look at this?
-
Extract text from PDF
I'd recommend trying py-pdf-parser [0] - it allows you to fetch data from documents based on text "markers". E.g. you can easily find data, located to the right of "EMAL FROM:" text [0] - https://github.com/jstockwin/py-pdf-parser
What are some alternatives?
mexican-jobs-2020 - Data ETL & Analysis on thousands of job listings from the official Mexican job board (2020 edition).
layout-parser - A Unified Toolkit for Deep Learning Based Document Image Analysis
reddit-bots - A collection of Reddit bots that I use to enhance the subreddits I manage.
pdfplumber - Plumb a PDF for detailed information about each char, rectangle, line, et cetera — and easily extract text and tables.
actions-bot - A tutorial explaining how to host and schedule a Discord webhook bot on GitHub Actions.
tika-python - Tika-Python is a Python binding to the Apache Tika™ REST services allowing Tika to be called natively in the Python community.
Maya - Datetimes for Humans™
EasyOCR - Ready-to-use OCR with 80+ supported languages and all popular writing scripts including Latin, Chinese, Arabic, Devanagari, Cyrillic and etc.
forhirehelper - A Python desktop application that makes the use of freelancing subreddits easier and faster.
pydantic - Data validation using Python type hints