pdfcpu
rich
Our great sponsors
pdfcpu | rich | |
---|---|---|
30 | 148 | |
6,206 | 46,981 | |
4.4% | 0.9% | |
9.0 | 8.3 | |
10 days ago | 7 days ago | |
Go | Python | |
Apache License 2.0 | MIT License |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
pdfcpu
- Show HN: A PDF Processing CLI/API Written in Go
- Show HN
-
Making a PDF that's larger than Germany
Slightly tangential: if you are hacking on PDFs, manually or otherwise, this is an incredibly useful tool: https://pdfcpu.io/ (not the author, just a user)
-
Stirling-PDF: local web application to perform various operations on PDFs
A really nice, stand-alone command line tool is pdfcpu.
https://github.com/pdfcpu/pdfcpu
-
pdfcpu v0.6.0 out! - pdfcpu.io
Check it out => https://github.com/pdfcpu/pdfcpu/releases/tag/v0.6.0
-
Marker: Convert PDF to Markdown quickly with high accuracy
I can report that the closest I've came before is with PDFMiner (https://pypi.org/project/pdfminer/) for Python. The benefit of this one is that it retains styling information, so that italics and the like can be retained, at least with some post-processing (I think one might need to convert certain CSS-classes to actual or tags).
The other option I have started looking into is the PDFCPU library for Go. It is a bit more low-level than PDFMiner, but one gets out very well structured info, that seem it might be possible to post-process quite well, for one's particular use case and PDF layouts: https://github.com/pdfcpu/pdfcpu
I also now tried the Marker tool in the OT, and it seems to do a reasonable job. It did intermingle some columns though, at least in some tricky cases such as when there were a round shaped image in between the two columns. One note is that Marker doesn't seem to retain styling like italics though.
-
PDFcpu snippet for read text of PDF file?
Of course, the best way would be to solve it via the API without CLI. But this doesn't seem to work. https://github.com/pdfcpu/pdfcpu/issues/122
- wie splittet ihr denn PDFs - ich hab hier einige - die ich zerlegen muss in Teile
- Do you know any library to make pdf in golang?
- Pdfcpu: A Go PDF Processor
rich
- Rich is a Python library for rich text and beautiful formatting in the terminal
-
Neat Parallel Output in Python
There is an open issue [1] on GitHub to make it more modular and get rid of markdown and syntax highlighting but I have no hope for rich to get more minimal.
[1]: https://github.com/Textualize/rich/issues/2277
-
Ask HN: Programmers and Technologists in Scotland
I hope he doesn't mind, but the creator of Rich and Textualize is a good guy, and Scottish: https://www.willmcgugan.com/about/
https://www.textualize.io/
https://github.com/Textualize/rich
-
Python 3.12
They keep getting improved error messaging and this is one of my favorite features. But I'd love if we could get some real rich text. Idk if anyone else uses rich, but it has infected all my programs now. Not just to print with colors, but because it makes debugging so much easier. Not just print(f"{var=}") but the handler[0,1]. Color is so important to these types of things and so is formatting. Plus, the progress bars are nice and have almost completely replaced tqdm for me[2]. They're just easier and prettier.
[0] https://rich.readthedocs.io/en/stable/logging.html
[1] Try this example: https://github.com/Textualize/rich/blob/master/examples/exce...
[2] Side note: does anyone know how to get these properly working when using DDP with pytorch? I get flickering when using this and I think it is actually down to a pytorch issue and how they're handling their loggers and flushing the screen. I know pytorch doesn't want to depend on rich, but hey, pip uses rich so why shouldn't everyone?
-
colors.crumb - first Crumb usable. Extending Crumb with basic terminal styling and RGB, HEX, ANSI conversion functions.
colors.crumb extends Crumb with basic terminal styling functions and RGB, HEX, ANSI conversion functions. It is in the realm of JavaScript's chalk and Python's rich but slightly more functional 😉.
-
Textual: Rapid Application Development Framework for Python
I am working on a new python project and one of the first things I added was https://github.com/Textualize/rich because of how easy it is to make things look good in the terminal.
-
What are you rewriting in rust?
I am not rewriting anything but I'd love to have a library like `rich` in Rust: https://github.com/textualize/rich
-
Things to do with standalone script
Add some cool-looking stuff to your output with rich.
-
I made a library for making user terminal input really really pretty!
You might consider taking inspiration from the rich module. In particular, I like how rich supports inline color theming which seems much more cumbersome in your framework, requiring the use of context managers as well as familiarity with how your framework structures color objects. Other than that though, I'm impressed!
-
coBib 4.0: a modern UI using Textualize libraries
Today I released coBib 4.0, my console bibliography manager written in Python, which now uses rich and textual to provide a cohesive and modern user experience in both its CLI and TUI.
What are some alternatives?
gopdf - A simple library for generating PDF written in Go lang
tqdm - :zap: A Fast, Extensible Progress Bar for Python and CLI
go-wkhtmltopdf - Go bindings for wkhtmltopdf and high-level HTML to PDF conversion interface
colorama - Simple cross-platform colored terminal text in Python
qpdf - QPDF: A content-preserving PDF document transformer
python-prompt-toolkit - Library for building powerful interactive command line applications in Python
merge2pdf - Merge Image and PDF files (optionally with selective pages) with lossless quality
textual - The lean application framework for Python. Build sophisticated user interfaces with a simple Python API. Run your apps in the terminal and a web browser.
markpdf - Watermark PDF files using image or text
blessed - Blessed is an easy, practical library for making python terminal apps
ngrok - Unified ingress for developers
alive-progress - A new kind of Progress Bar, with real-time throughput, ETA, and very cool animations!