InfluxDB 3 OSS is now GA. Transform, enrich, and act on time series data directly in the database. Automate critical tasks and eliminate the need to move data externally. Download now. Learn more →
Tabula Alternatives
Similar projects and alternatives to tabula
-
-
InfluxDB
InfluxDB – Built for High-Performance Time Series Workloads. InfluxDB 3 OSS is now GA. Transform, enrich, and act on time series data directly in the database. Automate critical tasks and eliminate the need to move data externally. Download now.
-
-
-
-
-
-
SaaSHub
SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives
-
markdown-cv
a simple template to write your CV in a readable markdown file and use CSS to publish/print it.
-
obsidian-notion-like-tables
Discontinued Your premiere tool for creating and managing tabular data in Obsidian.md
-
-
-
-
Traveling Ruby
Discontinued Self-contained Ruby binaries that can run on any Linux distribution and any macOS machine. [Moved to: https://github.com/FooBarWidget/traveling-ruby]
-
-
-
ITextSharp
[DEPRECATED] .NET port of the iText library, only security fixes will be added — please use iText for .NET
-
-
-
SaaSHub
SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives
tabula discussion
tabula reviews and mentions
-
Dive into Time-Series Anomaly Detection: A Decade Review
Maybe you can use tabula [0] to extract the information from the PDF?
https://github.com/tabulapdf/tabula
- Automatisches Auslesen von PDFs
- How To: Extract Table From Image In Python (OpenCV & OCR)
-
Ruby
Another option would be JRuby. I routinely use an application called Tabula, which is built using JRuby and compiles to a Jar file. This, of course, requires Java on the target machine, but you can ship the Jar file and it will work. It's often easier to rely on a working Java environment than it is a working Ruby environment. Especially on Windows.
- I am looking to automate a process at work...
-
Self Hosted Roundup #19
Idk if it has been suggested yet, tabulapdf is a self hosted solution to extract tables from PDF
- Alternative to tabula.technology
-
Text extraction from pdf, word and PPT
For table extraction from pdfs, have a look at Tabula and Camelot, two open-source projects. They work well with clean tables, both the Tabula Python binding and Camelot allow you to export directly as a pandas dataframe. Otherwise AWS Textract API is very efficient at extracting tables from pdfs, regardless of how clean/messy they are.
-
hello everyone someone can help me to resolve this problem please. i want to extract this unstructured data from pdf file to excel file
No idea if it will work for you, but there is a git project that seems to do what you want https://github.com/tabulapdf/tabula
- Why is the point of having so many implementation of Ruby?
-
A note from our sponsor - InfluxDB
www.influxdata.com | 23 Jun 2025
Stats
tabulapdf/tabula is an open source project licensed under MIT License which is an OSI approved license.
The primary programming language of tabula is CSS.