Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality. Learn more →
Top 4 HTML OCR Projects
-
unstructured
Open source libraries and APIs to build custom preprocessing pipelines for labeling, training, or production machine learning pipelines.
-
WorkOS
The modern identity platform for B2B SaaS. The APIs are flexible and easy-to-use, supporting authentication, user identity, and complex enterprise features like SSO and SCIM provisioning.
-
documentation
Documentation for Papermerge DMS - Installation, Help, User Manual, REST API (by papermerge)
-
Warframe-OCR
A relic inventory recognition system for Warframe, based on experimental Rust bindings to Tesseract OCR. Supports detection in real-time. Very much WIP.
Be careful with unstructured:
https://github.com/Unstructured-IO/unstructured/blob/d11c70c...
from: https://github.com/open-webui/open-webui/issues/687
Project mention: Show HN: Kimchi Reader – Immersive Korean Learning with a Popup Dictionary | news.ycombinator.com | 2023-10-29
HTML OCR related posts
- Show HN: Kimchi Reader – Immersive Korean Learning with a Popup Dictionary
- Unstructured – OSS libraries and APIs to build custom preprocessing pipelines
- More intelligent Pdf parsers
- Help extracting data from multiple PDF's
- Any way to convert my handwritten diary to searchable PDFs?
- Pre-processing text documents such as PDFs, HTML and Word Documents for LLMs
- Sites for anime or series sub japanese? or other forms of immersion.
-
A note from our sponsor - InfluxDB
www.influxdata.com | 28 Apr 2024
Index
What are some of the best open-source OCR projects in HTML? This list will help you:
Project | Stars | |
---|---|---|
1 | unstructured | 6,415 |
2 | mokuro | 706 |
3 | documentation | 13 |
4 | Warframe-OCR | 1 |
Sponsored