|2 days ago||3 days ago|
|BSD 3-clause "New" or "Revised" License||MIT License|
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
QuestPDF 2021.10 - a new version of the open-source, MIT-licensed, C# library for generating PDF documents with fluent API, now with extended text capabilities. Please help me make it popular :)
8 projects | reddit.com/r/csharp | 6 Oct 2021
I’d recommend Weasyprint (.net core wrapper) instead of wkhtmltopdf. It supports CSS Paged Media which is pretty much required for everything but the simplest of HTML2PDF conversions.
Is there a way to publish a PDF report in a Jenkins job?
1 project | reddit.com/r/jenkinsci | 16 Sep 2021
you can convert html to pdf using https://github.com/Kozea/WeasyPrint
Beautiful PDFs from HTML
13 projects | news.ycombinator.com | 4 Apr 2021
Yeah, in the Python world there's WeasyPrint for PDF out in the wild as well. It's quite slick, but it's a harder sell because of Python, which corporate types seem to think is bad hacker central.
WeasyPrint - The awesome document factory
1 project | reddit.com/r/programming | 27 Mar 20211 project | reddit.com/r/coolgithubprojects | 27 Mar 20211 project | reddit.com/r/opensource | 27 Mar 2021
WeasyPrint – Convert web documents to PDF
1 project | news.ycombinator.com | 27 Mar 2021
wkhtmltopdf - Convert HTML to PDF
3 projects | reddit.com/r/commandline | 26 Mar 2021
Another free CLI tool to consider is WeasyPrint. (Github)
How to search a PDF file for text matching keywords
1 project | reddit.com/r/learnpython | 14 Jan 2022
Extracting Data from PDFs
1 project | reddit.com/r/learnpython | 7 Jan 2022
I have had good results with https://github.com/jsvine/pdfplumber , not sure if it works with graphs too.
Wnat to use camelot for pdf-extraction / error-message ghostscript?
1 project | reddit.com/r/learnpython | 22 Dec 2021
Alternatively, you can try pdfplumber which doesn't have that dep.
Trying to pull data from this PDF document, couldn’t figure out PyPDF.
1 project | reddit.com/r/learnpython | 6 Dec 2021
It may be easier if you can extract the "table" - pdfplumber is good for this.
I am a python newbie and want to know the complexity of this project idea before I begin.
1 project | reddit.com/r/learnpython | 7 Oct 2021
For getting tables and other structured data out of a pdf, consider using pdfplumber. It's an open source project on github, written in python. I've used it to automate the task of extracting tables and code samples from pdfs provided as homework prompts.
PDFPlumber: Pattern For Extracting Name in 'Last, First MI' Format
1 project | reddit.com/r/learnpython | 19 Sep 2021
From https://github.com/jsvine/pdfplumber regarding .extract_text()
Image Occlusion Cards from PDF, based on PDF Text Formatting
1 project | reddit.com/r/Anki | 1 May 2021
It appears https://github.com/jsvine/pdfplumber might have the API to "identify the text based on the formatting".
Scrape a PDF from the Web? Looking for guidance...
1 project | reddit.com/r/learnpython | 23 Mar 2021
How do I ceate a JSON file from information in PDF
1 project | reddit.com/r/learnpython | 9 Mar 2021
Need advice on pattern recognition image search within PDF
1 project | reddit.com/r/learnpython | 26 Feb 2021
What are some alternatives?
PyPDF2 - A utility to read and write PDFs with Python
PDFMiner - Python PDF Parser (Not actively maintained). Check out pdfminer.six.
pdfminer.six - Community maintained fork of pdfminer - we fathom PDF
pdftabextract - A set of tools for extracting tables from PDF files helping to do data mining on (OCR-processed) scanned documents.
pymorphy2 - Morphological analyzer / inflection engine for Russian and Ukrainian languages.
MathJax - Beautiful and accessible math in all browsers
borb - borb is a library for reading, creating and manipulating PDF files in python.
WKHTMLToPDF - Convert HTML to PDF using Webkit (QtWebKit)
Camelot - A Python library to extract tabular data from PDFs