openvino
mmocr
openvino | mmocr | |
---|---|---|
17 | 6 | |
5,962 | 4,077 | |
3.0% | 1.6% | |
10.0 | 4.7 | |
about 10 hours ago | 10 days ago | |
C++ | Python | |
Apache License 2.0 | Apache License 2.0 |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
openvino
- FLaNK Stack 05 Feb 2024
- QUIK is a method for quantizing LLM post-training weights to 4 bit precision
- Intel OpenVINO 2023.1.0 released
- Intel OpenVINO 2023.1.0 released, open-source toolkit for optimizing and deploying AI inference
- OpenVINO 2023.1.0 released
- [N] Intel OpenVINO 2023.1.0 released, open-source toolkit for optimizing and deploying AI inference
-
Powering Anomaly Detection for Industry 4.0
Anomalib is an open-source deep learning library developed by Intel that makes it easy to benchmark different anomaly detection algorithms on both public and custom datasets, all by simply modifying a config file. As the largest public collection of anomaly detection algorithms and datasets, it has a strong focus on image-based anomaly detection. It’s a comprehensive, end-to-end solution that includes cutting-edge algorithms, relevant evaluation methods, prediction visualizations, hyperparameter optimization, and inference deployment code with Intel’s OpenVINO Toolkit.
mmocr
-
Show HN: BetterOCR combines and corrects multiple OCR engines with an LLM
Yup! But I'm still exploring options. (any recommendations would be welcomed!) Here are some candidates I'm considering:
- https://github.com/mindee/doctr
- https://github.com/open-mmlab/mmocr
- https://github.com/PaddlePaddle/PaddleOCR (honestly I don't know Mandarin so I'm a bit stuck)
- https://github.com/clovaai/donut - While it's primarily an "OCR-free document understanding transformer," I think it's worth experimenting with. Think I can sort this out by letting the LLM reason through it multiple times (although this will impact performance)
- yesterday got a suggestion to consider https://github.com/kakaobrain/pororo - I don't think development is still active but the results are pretty great on Korean text
-
MMDeploy: Deploy All the Algorithms of OpenMMLab
MMOCR: OpenMMLab text detection, recognition, and understanding toolbox.
-
[P]Modern open-source OCR capabilities and which model to choose
Link: https://github.com/open-mmlab/mmocr
-
Text Classification Library for a Quick Baseline
For more text classification baselines (CRNN, NRTR, RubustScanner, SAR, SegOCR), checkout https://github.com/open-mmlab/mmocr They are reproducible, customizable.
-
[N] MMOCR: A Toolbox for Text Detection, Recognition, and Understanding Based on PyTorch
We just released https://github.com/open-mmlab/mmocr, a new member in OpenMMLab https://openmmlab.com/. This first release supports
- OCR Baselines Based on PyTorch
What are some alternatives?
TensorRT - NVIDIA® TensorRT™ is an SDK for high-performance deep learning inference on NVIDIA GPUs. This repository contains the open source components of TensorRT.
PaddleOCR - Awesome multilingual OCR toolkits based on PaddlePaddle (practical ultra lightweight OCR system, support 80+ languages recognition, provide data annotation and synthesis tools, support training and deployment among server, mobile, embedded and IoT devices)
deepsparse - Sparsity-aware deep learning inference runtime for CPUs
CRAFT-pytorch - Official implementation of Character Region Awareness for Text Detection (CRAFT)
mediapipe - Cross-platform, customizable ML solutions for live and streaming media.
doctr - docTR (Document Text Recognition) - a seamless, high-performing & accessible library for OCR-related tasks powered by Deep Learning.
stable-diffusion - Go to lstein/stable-diffusion for all the best stuff and a stable release. This repository is my testing ground and it's very likely that I've done something that will break it.
deep-text-recognition-benchmark - Text recognition (optical character recognition) with deep learning methods, ICCV 2019
neural-compressor - SOTA low-bit LLM quantization (INT8/FP8/INT4/FP4/NF4) & sparsity; leading model compression techniques on TensorFlow, PyTorch, and ONNX Runtime
iam-crnn-ctc-recognition - IAM Dataset Handwriting Recognition Using CRNN, CTC Loss, DeepSpeech Beam Search, And KenLM Scorer
nebuly - The user analytics platform for LLMs
keras-ocr - A packaged and flexible version of the CRAFT text detector and Keras CRNN recognition model.