paper-bidsheets
llama-ocr
paper-bidsheets | llama-ocr | |
---|---|---|
1 | 3 | |
7 | 2,226 | |
- | 7.0% | |
5.0 | 5.8 | |
4 months ago | 2 months ago | |
Go | TypeScript | |
- | - |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
paper-bidsheets
-
Llama-OCR: An Open-Source Llama 3.2 Based OCR Tool
I have recently used llama3.2-vision to handle some paper bidsheets for a charity auction and it is fairly accurate with some terrible handwriting. I hope to use it for my event next year.
I do find it rather annoying not being able to get it to consistently output a CSV though. ChatGPT and Gemini seem better at doing that but I haven’t tried to automate it.
The scale of my problem is about 100 pages of bidsheets and so some manual cleaning is ok. It is certainly better than burning volunteers time.
https://github.com/philips/paper-bidsheets
llama-ocr
-
Ollama-OCR for High-Precision OCR with Ollama
ollama-ocr is using a local vision model, if you want to use the online Llama 3.2-Vision model, try the llama-ocr library.
-
Supercharge Your AI Skills: 5 Open Source Repositories You Can't Afford to Miss
4. Llama OCR
-
Llama-OCR: An Open-Source Llama 3.2 Based OCR Tool
Here's the prompt being used, tweaking that might help: https://github.com/Nutlope/llama-ocr/blob/main/src/index.ts#...
What are some alternatives?
OCRmyPDF - OCRmyPDF adds an OCR text layer to scanned PDF files, allowing them to be searched
ollama-ocr - Implementing OCR with a local visual model run by ollama.
wordninja - Probabilistically split concatenated words using NLP based on English Wikipedia unigram frequencies.
Thinking-Claude - Let your Claude able to think
nougat - Implementation of Nougat Neural Optical Understanding for Academic Documents
zerox - OCR & Document Extraction using vision models