Reducing the Size of Large PDFs

This page summarizes the projects mentioned and recommended in the original post on news.ycombinator.com

InfluxDB - Power Real-Time Data Analytics at Scale
Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
www.influxdata.com
featured
SaaSHub - Software Alternatives and Reviews
SaaSHub helps you find the best software and product alternatives
www.saashub.com
featured
  • matplotlib

    matplotlib: plotting with Python

  • Oh didn't know about the improved type 42 font support in the new matplotlib! That's good to know and I should check it out.

    And good point, the PGF works just as well (results should be identical), but since all the plot information has to be compiled by latex, it ends up ballooning the compilation time of the tex doc and the matplotlib PGF page suggests that you can run into memory issues as well. I was doing this for a thesis with 50+ plots and so still wanted compilation to be fast.

    I've suggested this as an improvement to matplotlib, but unlikely to be merged since maybe it's a bit hacky (although it's very similar to what Inkscape's export to LaTeX option does): https://github.com/matplotlib/matplotlib/issues/22297 (the backend file can be found here: https://github.com/matplotlib/matplotlib/files/7921801/backe...)

    And the gs script is below:

      #!/bin/bash

  • pdfsizeopt

    PDF file size optimizer

  • There is a general PDF shrinker, known as "pdfsizeopt" that is bundled with static builds of gs and a number of other utilities.

    It cuts some of our PDFs to 10x smaller, mostly by removing unused fonts (but doubtless also some other magic).

    The developer asks for donations for production use from those who can afford it.

    https://github.com/pts/pdfsizeopt

    Send donations to the author of pdfsizeopt:

    https://flattr.com/submit/auto?user_id=pts&url=https://githu...

  • InfluxDB

    Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

    InfluxDB logo
NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a more popular project.

Suggest a related project

Related posts

  • How and where is matplotlib package making use of PySide?

    2 projects | /r/learnpython | 7 Dec 2023
  • Tkinter, PyGame windows too large on Mac

    1 project | /r/learnpython | 29 Jun 2023
  • [OC] Attempted & Completed Suicide Rate in Canada, 1998/99

    1 project | /r/dataisbeautiful | 5 Mar 2023
  • Help unpickling an old dataset

    1 project | /r/learnpython | 8 Feb 2023
  • pcolormesh very slow when using "log" axes

    1 project | /r/learnpython | 7 Feb 2023