SaaSHub helps you find the best software and product alternatives Learn more →
Top 3 document-parser Open-Source Projects
-
unstructured
Open source libraries and APIs to build custom preprocessing pipelines for labeling, training, or production machine learning pipelines.
-
ragflow
RAGFlow is an open-source RAG (Retrieval-Augmented Generation) engine based on deep document understanding.
-
InfluxDB
Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
Be careful with unstructured:
https://github.com/Unstructured-IO/unstructured/blob/d11c70c...
from: https://github.com/open-webui/open-webui/issues/687
Project mention: DeepSeek-V2 integrated, RAGFlow v0.5.0 is released | news.ycombinator.com | 2024-05-07
Project mention: Show HN: Beyond text splitting – improved file parsing for LLM's | news.ycombinator.com | 2024-04-07https://github.com/deepdoctection/deepdoctection
Have you tried this ?
document-parser related posts
-
Show HN: Beyond text splitting – improved file parsing for LLM's
-
DeepDoctection
-
DeepDoctection: Document extraction and analysis using deep learning models
-
DeepDoctection: Document extraction and analysis using deep learning models
-
DeepDoctection: Document extraction and analysis using deep learning models
-
DeepDoctection
-
A note from our sponsor - SaaSHub
www.saashub.com | 11 May 2024
Index
What are some of the best open-source document-parser projects? This list will help you:
Project | Stars | |
---|---|---|
1 | unstructured | 6,682 |
2 | ragflow | 6,507 |
3 | deepdoctection | 2,209 |
Sponsored