Mayan EDMS
EasyOCR
Our great sponsors
Mayan EDMS | EasyOCR | |
---|---|---|
34 | 38 | |
549 | 21,795 | |
8.0% | 2.7% | |
0.0 | 4.6 | |
2 months ago | 24 days ago | |
Python | Python | |
GNU General Public License v3.0 or later | Apache License 2.0 |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
Mayan EDMS
-
A Clutter-Free Life: Going Paperless with Paperless-Ngx
I use MayanEDMS personally, and have for the past five or so years. It's complex but does what it says on the tin.
- Sistema de gestión documental
-
Document Management with REST API and User Permissions
Mayan EDMS has an api and rbac. https://www.mayan-edms.com/
-
Can anybody recommend a document management system?
I heard good things about https://www.mayan-edms.com/ Never used it myself, though.
-
Question from our chef
You can scan all the invoices into TIF / PDF format and then use a (free) program with Optical Character Recognition (OCR) like this (https://www.mayan-edms.com/) to index them. This will allow you to search for the key words with a filter on the date of scan.
- Software for sending files/media to clients for revisions/approvals?
-
Hermes, an Open Source Document Management System
There's also Mayan EDMS [1]. I have no experience with it, but looks sensible from the outside.
- PDF / DOC Library?
-
Anything self hosted like paperless.io?
Mayan EDMS
-
Electronic material/document management system wanted (PDFs, videos, audio files, review process, feedback, user management, etc.)
I think Mayan EDMS meets all criteria
EasyOCR
-
Leveraging GPT-4 for PDF Data Extraction: A Comprehensive Guide
PyTesseract Module [ Github ] EasyOCR Module [ Github ] PaddlePaddle OCR [ Github ]
- OCR a lot of hand written invoice and records?
-
[P] EasyOCR in C++!
I just uploaded my C++ implementation of EasyOCR, a well known ocr library for python. Also dusted some cobwebbs from some audio related projects as well, feel free to leave feedback or contribute! I only implemented the most salient parts, so certainly could use some community help! Cheers!
-
OCR at Edge on Cloudflare Constellation
EasyOCR is a popular project if you are in an environment where you can use run Python and PyTorch (https://github.com/JaidedAI/EasyOCR). Other open source projects of note are PaddleOCR (https://github.com/PaddlePaddle/PaddleOCR) and docTR (https://github.com/mindee/doctr).
-
Donut: OCR-Free Document Understanding Transformer
The main one was https://github.com/JaidedAI/EasyOCR, mostly because, as promised, it was pretty easy to use, and uses pytorch (which I preferred in case I wanted to tweak it). It has been updated since, but at the time it was using CRNN, which is a solid model, especially for the time - it wasn't (academic) SOTA but not far behind that. I'm sure I could've coaxed better performance than I got out of it with some retraining and hyperparameter tuning.
-
Help with OCR of pixel-y numbers
Anyways, you can give a shot to EasyOCR, pretty solid and flexible
- How to perform document OCR?
-
Python unexpectedly quits (macOS ventura, M1)
The easyocr library: https://github.com/JaidedAI/EasyOCR
- I made a website for a friend who owns a restaurant. He's wondering if there's a way to upload a picture of his menu daily. What is the best way to do this?
-
Raspberry Pi Easyocr
Not used it on a Pi but maybe a Docker version (if there is one) would run? Compose file here
What are some alternatives?
Paperless-ng - A supercharged version of paperless: scan, index and archive all your physical documents
PaddleOCR - Awesome multilingual OCR toolkits based on PaddlePaddle (practical ultra lightweight OCR system, support 80+ languages recognition, provide data annotation and synthesis tools, support training and deployment among server, mobile, embedded and IoT devices)
paperless-ngx - A community-supported supercharged version of paperless: scan, index and archive all your physical documents
tesseract-ocr - Tesseract Open Source OCR Engine (main repository)
Papermerge - Open Source Document Management System for Digital Archives (Scanned Documents)
doctr - docTR (Document Text Recognition) - a seamless, high-performing & accessible library for OCR-related tasks powered by Deep Learning.
Paperless - Scan, index, and archive all of your paper documents
OpenCV - Open Source Computer Vision Library
Teedy - Lightweight document management system packed with all the features you can expect from big expensive solutions
awesome-colab-notebooks - Collection of google colaboratory notebooks for fast and easy experiments
Docspell - Assist in organizing your piles of documents, resulting from scanners, e-mails and other sources with miminal effort.
tesserocr - A Python wrapper for the tesseract-ocr API