Traveling Ruby
tabula
Traveling Ruby | tabula | |
---|---|---|
6 | 11 | |
2,005 | 6,534 | |
- | 0.8% | |
5.8 | 2.8 | |
over 2 years ago | about 1 month ago | |
Shell | CSS | |
MIT License | MIT License |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
Traveling Ruby
-
Ruby
If you absolutely need a native binary distribution for your apps, there is a project called Traveling Ruby that originated at Phusion, makers of the popular Phusion Passenger Ruby application server. It's worth noting that this project has a number of open issues that are aging and the latest commits are from 2021, so I'm not sure about its current status. There are also important caveats with regard to native extensions and Windows. Given the popularity of packages that require native extensions (like the XML/HTML library Nokogiri), you may find that this solution simply doesn't work for you.
- Is there a way to package up a Ruby script as a desktop executable app?
-
Having issues installing Ruby
You may be to get a precompiled binary with OpenSSL 1.1 statically linked. Maybe Traveling Ruby? https://github.com/phusion/traveling-ruby
-
Alternatives for Ocra ???
There's really not much else in this space. The main alternative - Traveling Ruby - has limitations on Windows and I don't think it supports Ruby 3.0.
-
Vagrant is being rewritten in Go.
But even with all of the above, you're absolutely right, it is just easier to ship a binary blob. That's where the rewrite totally pays off. I just wonder whether the team has stressed all the options when it comes to keep ruby. There are packaging solutions which ship with its own interpreter, such as Travelling Ruby. And mruby could also generate a binary blob, although they'd have to open another can of works, such as finding replacements for dependencies such as net-ssh, which AFAIK can't be used with mruby. So in the end, maybe they did. And given the prevalence of go products in hashicorp, maybe it makes sense to just invest a bit more in it?
-
My Ruby game is getting false positives in virus scanners. Help?
You could try using Traveling Ruby as an alternative to Ocra. I have only used Ocra in the past for this task, but I'd say it's worth a try.
tabula
- Automatisches Auslesen von PDFs
- How To: Extract Table From Image In Python (OpenCV & OCR)
-
Ruby
Another option would be JRuby. I routinely use an application called Tabula, which is built using JRuby and compiles to a Jar file. This, of course, requires Java on the target machine, but you can ship the Jar file and it will work. It's often easier to rely on a working Java environment than it is a working Ruby environment. Especially on Windows.
- I am looking to automate a process at work...
-
Self Hosted Roundup #19
Idk if it has been suggested yet, tabulapdf is a self hosted solution to extract tables from PDF
- Alternative to tabula.technology
-
Text extraction from pdf, word and PPT
For table extraction from pdfs, have a look at Tabula and Camelot, two open-source projects. They work well with clean tables, both the Tabula Python binding and Camelot allow you to export directly as a pandas dataframe. Otherwise AWS Textract API is very efficient at extracting tables from pdfs, regardless of how clean/messy they are.
-
hello everyone someone can help me to resolve this problem please. i want to extract this unstructured data from pdf file to excel file
No idea if it will work for you, but there is a git project that seems to do what you want https://github.com/tabulapdf/tabula
- Why is the point of having so many implementation of Ruby?
-
Pdfsandwich
While trying to find a specific project I recalled, I encountered this list of projects which might be of interest: https://github.com/tstanislawek/awesome-document-understandi...
The project I had in mind was similar to this one but I can't remember the name currently: https://github.com/tabulapdf/tabula
However, if you're looking for a ML-based, invoice-specific project looks like the other comment to your reply might be more useful.
What are some alternatives?
Codacy
Apache PDFBox - Mirror of Apache PDFBox
OctoLinker - OctoLinker — Links together, what belongs together
obsidian-notion-like-tables - Your premiere tool for creating and managing tabular data in Obsidian.md
Hakiri - Secure Ruby apps with Hakiri
awesome-english-ebooks - 经济学人(含音频)、纽约客、卫报、连线、大西洋月刊等英语杂志免费下载,支持epub、mobi、pdf格式, 每周更新
Gitlab CI - GitLab CE Mirror | Please open new issues in our issue tracker on GitLab.com
ripgrep-all - rga: ripgrep, but also search in PDFs, E-Books, Office documents, zip, tar.gz, etc.
PR Dashboard
OCRmyPDF - OCRmyPDF adds an OCR text layer to scanned PDF files, allowing them to be searched
HuBoard - Kanban board for github issues
laravel-report-generator - Rapidly Generate Simple Pdf, CSV, & Excel Report Package on Laravel