pdfplumber vs mupdf

Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

www.influxdata.com

featured

SaaSHub - Software Alternatives and Reviews

SaaSHub helps you find the best software and product alternatives

www.saashub.com

featured

pdfplumber		mupdf
	Project
29	Mentions	28
5,603	Stars	55
-	Growth	-
8.2	Activity	8.8
16 days ago	Latest Commit	4 months ago
Python	Language	C
MIT License	License	GNU Affero General Public License v3.0

The number of mentions indicates the total number of mentions that we've tracked plus the number of user suggested alternatives.
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.

pdfplumber

Posts with mentions or reviews of pdfplumber. We have used some of these posts to build our list of alternatives and similar projects. The last one was on 2024-03-30.

Running OCR against PDFs and images directly in the browser
7 projects | news.ycombinator.com | 30 Mar 2024
Google Scholar PDF Reader
11 projects | news.ycombinator.com | 20 Mar 2024

- [pdfplumber](https://github.com/jsvine/pdfplumber)
Parsing dates with PDFminer
1 project | /r/learnpython | 4 Jul 2023
How to Extract Data from Tables in a Public Record PDF
2 projects | /r/Journalism | 26 Jun 2023

I recently published a story that was based on some data analysis I did of a report I obtained from the Department of Behavioral Health and Developmental Services in VA. I wanted to share a quick walkthrough of how I extracted the data from tables in a PDF using a Python module called PDFplumber. I also uploaded a video to Youtube if you prefer that.
Code to extract text from pdf to excel
2 projects | /r/Python | 2 Jun 2023

I've been working with pdfplumber, which is built atop pdfminer.six. It allows one to break the page up into sections and extract text from them in turn, which may help keep columns separated better.
I need to parse unstructured tables from a pdf into a json, what can I do
1 project | /r/computervision | 21 May 2023

You could try pdfplumber
Advanced PDF to Excel with documents and example code
2 projects | /r/learnpython | 1 May 2023

I'm not sure if there is a way to reliably detect bold characters: https://github.com/jsvine/pdfplumber/issues/724
how do I automate extracting data from two pdfs and input into an excel sheet according to an order number
2 projects | /r/learnpython | 24 Apr 2023

pdfplumber is also pretty good. It can help segment text a bit better than pdfminer can alone.
Extracting particular things from pdf program?
1 project | /r/learnpython | 21 Jan 2023

To handle machine generated one, a possible package is pdfplumber.
Convert PDF to text for parsing
1 project | /r/learnpython | 14 Jan 2023

mupdf

Posts with mentions or reviews of mupdf. We have used some of these posts to build our list of alternatives and similar projects. The last one was on 2023-09-15.

⟳ 4 apps added, 121 updated at f-droid.org
24 projects | /r/FDroidUpdates | 15 Sep 2023

MuPDF mini (version 1.23.3a): Minimalist viewer for PDF, XPS, CBZ, unprotected EPUB, and FB2 documents
Alexandria: A minimalistic cross-platform eBook reader
12 projects | news.ycombinator.com | 28 Aug 2023

mupdf is the mupdf of epub; it supports epub and other formats beyond pdf¹. When I've had really large files I've used mupdf to read them a few times, as it seems to be far better at handling them than other tools.
¹ https://mupdf.com/
PDFs - zerlegen unter Linux: in 40 Ordner alle PDFs in Onepager umwandeln - mit einem Schritt - Tools und Verfahren?
1 project | /r/de_EDV | 11 May 2023

Mupdf https://mupdf.com MuPDF is a lightweight bla…
Pdf reader for less memory consumption
2 projects | /r/Windows10 | 22 Mar 2023

Doesn't exactly fit your requirements, but might as well mention it here: muPDF - the most lightweight PDF viewer that I've ever seen. You don't even have to install it. Mobile version exists by the way.
⟳ 1 apps added, 74 updated at f-droid.org
15 projects | /r/FDroidUpdates | 21 Mar 2023

MuPDF viewer (version 1.21.0a): Lightweight document viewer
a good pdf reader
2 projects | /r/linux | 6 Mar 2023

Mupdf is nice https://mupdf.com/
Zathura can't manage big files
1 project | /r/archlinux | 16 Dec 2022

I don't see anything about that on ArchWiki or MuPDF's website. Could you provide details on how it is obsolete and what critical vulnerabilities it has?
Uma introdução de Active Storage em Rails 7
2 projects | dev.to | 14 Dec 2022

poppler ou muPDF para pré-visualizações de PDF
Show HN: I am building a new Python library to read/write PDF files
17 projects | news.ycombinator.com | 17 Nov 2022

I think you might mean PyMuPDF (https://github.com/pymupdf/PyMuPDF), a Python library built on top of the MuPDF C library (https://mupdf.com/).
PyMuPDF and MuPDF are both available under dual open source AGPL and commercial licenses. They have been around for many years and are under continual development.
[Disclaimer, i work for Artifex, who wrote MuPDF and recently acquired PyMuPDF.]
Private reading app for iOS
2 projects | /r/PrivacyGuides | 16 Nov 2022

Have you tried MuPDF?

What are some alternatives?

When comparing pdfplumber and mupdf you can also consider the following projects:

PDFMiner - Python PDF Parser (Not actively maintained). Check out pdfminer.six.

Apache PDFBox - Mirror of Apache PDFBox

PyPDF2 - A pure-python PDF library capable of splitting, merging, cropping, and transforming the pages of PDF files

okular - KDE document viewer

OCRmyPDF - OCRmyPDF adds an OCR text layer to scanned PDF files, allowing them to be searched

NekoX - A third-party Telegram android app.

pdfminer.six - Community maintained fork of pdfminer - we fathom PDF

peertube-android - Thorium, a PeerTube Android Client

py-pdf-parser - A Python tool to help extracting information from structured PDFs.

ics-openvpn - OpenVPN for Android

PyMuPDF - PyMuPDF is a high performance Python library for data extraction, analysis, conversion & manipulation of PDF (and other) documents.

sioyek-website

pdfplumber vs PDFMiner mupdf vs Apache PDFBox pdfplumber vs PyPDF2 mupdf vs okular pdfplumber vs OCRmyPDF mupdf vs NekoX pdfplumber vs pdfminer.six mupdf vs peertube-android pdfplumber vs py-pdf-parser mupdf vs ics-openvpn pdfplumber vs PyMuPDF mupdf vs sioyek-website

Compare pdfplumber vs mupdf and see what are their differences.

pdfplumber

mupdf

pdfplumber

mupdf

What are some alternatives?