invoice2data

Extract structured data from PDF invoices (by invoice-x)

Invoice2data Alternatives

Similar projects and alternatives to invoice2data

  • OCRmyPDF

    OCRmyPDF adds an OCR text layer to scanned PDF files, allowing them to be searched

  • DeepSpeech

    DeepSpeech is an open source embedded (offline, on-device) speech-to-text engine which can run in real time on devices ranging from a Raspberry Pi 4 to high power GPU servers.

  • InfluxDB

    Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

    InfluxDB logo
  • silero-models

    32 invoice2data VS silero-models

    Silero Models: pre-trained speech-to-text, text-to-speech and text-enhancement models made embarrassingly simple

  • EasyOCR

    Ready-to-use OCR with 80+ supported languages and all popular writing scripts including Latin, Chinese, Arabic, Devanagari, Cyrillic and etc.

  • gensim

    Topic Modelling for Humans

  • orange

    🍊 :bar_chart: :bulb: Orange: Interactive data analysis

  • traprange

    (Java)A Method to Extract Tabular Content from PDF Files

  • WorkOS

    The modern identity platform for B2B SaaS. The APIs are flexible and easy-to-use, supporting authentication, user identity, and complex enterprise features like SSO and SCIM provisioning.

    WorkOS logo
  • pyod

    A Comprehensive and Scalable Python Library for Outlier Detection (Anomaly Detection)

NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a better invoice2data alternative or higher similarity.

invoice2data reviews and mentions

Posts with mentions or reviews of invoice2data. We have used some of these posts to build our list of alternatives and similar projects. The last one was on 2023-01-28.
  • Utilize OpenAI API to extract information from PDF files
    2 projects | dev.to | 28 Jan 2023
    Using regex: to match patterns in text after converting the PDF to plain text. Examples include invoice2data and traprange-invoice. However, this method requires knowledge of the format of the data fields.
  • Base64.ai – Extract text, data, photos and more from all types of docs
    4 projects | news.ycombinator.com | 10 Feb 2021
    It's not really working. Tried 2 English PDF invoices. Normal format. One came back empty, the other only had the amount right.

    I'm assuming they only trained on some specific documents (passport of country X, etc) and all others don't work.

    If someone processes the same document all the time, then my invoice2data project may work better and is open source. It's based on Regx, rather than machine learning: https://github.com/invoice-x/invoice2data

Stats

Basic invoice2data repo stats
2
1,685
6.6
about 1 month ago

invoice-x/invoice2data is an open source project licensed under MIT License which is an OSI approved license.

The primary programming language of invoice2data is Python.


Sponsored
SaaSHub - Software Alternatives and Reviews
SaaSHub helps you find the best software and product alternatives
www.saashub.com