llama2_aided_tesseract
charles-dickens_a-christma
llama2_aided_tesseract | charles-dickens_a-christma | |
---|---|---|
4 | 2 | |
204 | - | |
- | - | |
7.2 | - | |
10 months ago | - | |
Python | ||
- | - |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
llama2_aided_tesseract
-
Standard Ebooks
I made a tool like that, and I bet with a more powerful LLM like GPT4, and perhaps a better baseline OCR tool (like GPT4 vision), it could work really well for this sort of thing:
https://github.com/Dicklesworthstone/llama2_aided_tesseract
- Use Llama2 to Improve the Accuracy of Tesseract OCR
- FLaNK Stack Weekly for 07August2023
- Show HN: Using LLama2 to Correct OCR Errors
charles-dickens_a-christma
-
Standard Ebooks
Sometimes capitalisation matters are close to purely stylistic, but other times they really are part of the content, guiding pronunciation or emphasis, so that lowercasing them harms the work. What is your opinion of my assessment in the above comment of some of the specific changes in <https://github.com/standardebooks/charles-dickens_a-christma...>?
What are some alternatives?
harlequin - The SQL IDE for Your Terminal.
tools - The Standard Ebooks toolset for producing our ebook files.
gorilla-cli - LLMs for your CLI
OpenBuddy - Open Multilingual Chatbot for Everyone
CallCMLModel - An example on calling models deployed in CML
EverythingApacheNiFi - EverythingApacheNiFi
fuzzy-matcher - A Java library to determine probability of objects being similar.
anomalib - An anomaly detection library comprising state-of-the-art algorithms and features such as experiment management, hyper-parameter optimization, and edge inference.
ToolBench - [ICLR'24 spotlight] An open platform for training, serving, and evaluating large language model for tool learning.
fiftyone - The open-source tool for building high-quality datasets and computer vision models
opensms - Open-source solution to programmatically send and receive SMS using your own SIM cards
Transformers-Tutorials - This repository contains demos I made with the Transformers library by HuggingFace.