document-analysis

Open-source projects categorized as document-analysis
Language: + Python + C# + C++

Top 6 document-analysis Open-Source Projects

  • PdfPig

    Read and extract text and other content from PDFs in C# (port of PDFBox)

  • awesome-document-understanding

    A curated list of resources for Document Understanding (DU) topic

  • InfluxDB

    Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

    InfluxDB logo
  • pandora

    Pandora is an analysis framework to discover if a file is suspicious and conveniently show the results (by pandora-analysis)

  • robin

    RObust document image BINarization (by masyagin1998)

  • local_adaptive_binarization

    Local adaptive image binarization

  • pydoxtools

    Effortlessly extract information from unstructured data with this library, utilizing advanced AI techniques. Compose AI in customizable pipelines and diverse sources for your projects.

  • Project mention: What is the most cost-efficient way to have an embedding generator endpoint that is using an open-source embedding model? [D] | /r/MachineLearning | 2023-06-01
NOTE: The open source projects on this list are ordered by number of github stars. The number of mentions indicates repo mentiontions in the last 12 Months or since we started tracking (Dec 2020).

document-analysis related posts

Index

What are some of the best open-source document-analysis projects? This list will help you:

Project Stars
1 PdfPig 1,462
2 awesome-document-understanding 1,115
3 pandora 234
4 robin 169
5 local_adaptive_binarization 124
6 pydoxtools 54

Sponsored
SaaSHub - Software Alternatives and Reviews
SaaSHub helps you find the best software and product alternatives
www.saashub.com