mmpose
mmocr
Our great sponsors
mmpose | mmocr | |
---|---|---|
31 | 6 | |
5,002 | 4,077 | |
4.8% | 3.1% | |
8.0 | 4.7 | |
4 days ago | 6 days ago | |
Python | Python | |
Apache License 2.0 | Apache License 2.0 |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
mmpose
-
RTMPose: The All-In-One Real-time Pose Estimation Solution for R&D
RTMPose-m achieves 75.8% AP on COCO with 90+ FPS on an Intel i7-11700 CPU and 430+ FPS on an NVIDIA GTX 1660 Ti GPU, and RTMPose-l achieves 67.0% AP on COCO-WholeBody with 130+ FPS.
-
MMDeploy: Deploy All the Algorithms of OpenMMLab
MMPose: OpenMMLab pose estimation toolbox and benchmark.
-
Model conversion from Pytorch to Tf using Onnx.
I downloaded pytorch2onnx.py from mmPose tools. It's big, but the top half is imports and input arguments. Line 125, I hard-coded my (image) input size. I ran it on my .pth model file, and out pop'd an onnx file.
-
Finetuning Openpose for custom dataset
They have a specific repo called mmpose: https://github.com/open-mmlab/mmpose
-
State of the art 2D body pose estimation [Discussion]
I would start with mmpose. It's basically a curated list of the best models ready to go.
-
[P] Object detection framework : Detectron2 VS MMDetection
The [MMLab key point detection](https://github.com/open-mmlab/mmpose) is in a separate repo from detection.
-
[D] Searching for open source pose estimation solution similar to open pose ?
One option is mmPose. They have a bunch of 2D/3D models implemented and support different skeleton structures.
-
Human Pose Estimation Recommendation
This library is pretty good. It has implementations for a number of pose estimators. I think Darkpose is the best one from memory
-
Human pose classification problem.
Check out https://github.com/open-mmlab/mmpose I think they have guides for new datasets
mmocr
-
Show HN: BetterOCR combines and corrects multiple OCR engines with an LLM
Yup! But I'm still exploring options. (any recommendations would be welcomed!) Here are some candidates I'm considering:
- https://github.com/mindee/doctr
- https://github.com/open-mmlab/mmocr
- https://github.com/PaddlePaddle/PaddleOCR (honestly I don't know Mandarin so I'm a bit stuck)
- https://github.com/clovaai/donut - While it's primarily an "OCR-free document understanding transformer," I think it's worth experimenting with. Think I can sort this out by letting the LLM reason through it multiple times (although this will impact performance)
- yesterday got a suggestion to consider https://github.com/kakaobrain/pororo - I don't think development is still active but the results are pretty great on Korean text
-
MMDeploy: Deploy All the Algorithms of OpenMMLab
MMOCR: OpenMMLab text detection, recognition, and understanding toolbox.
-
[P]Modern open-source OCR capabilities and which model to choose
Link: https://github.com/open-mmlab/mmocr
-
Text Classification Library for a Quick Baseline
For more text classification baselines (CRNN, NRTR, RubustScanner, SAR, SegOCR), checkout https://github.com/open-mmlab/mmocr They are reproducible, customizable.
-
[N] MMOCR: A Toolbox for Text Detection, Recognition, and Understanding Based on PyTorch
We just released https://github.com/open-mmlab/mmocr, a new member in OpenMMLab https://openmmlab.com/. This first release supports
- OCR Baselines Based on PyTorch
What are some alternatives?
openpose - OpenPose: Real-time multi-person keypoint detection library for body, face, hands, and foot estimation
PaddleOCR - Awesome multilingual OCR toolkits based on PaddlePaddle (practical ultra lightweight OCR system, support 80+ languages recognition, provide data annotation and synthesis tools, support training and deployment among server, mobile, embedded and IoT devices)
openpifpaf - Official implementation of "OpenPifPaf: Composite Fields for Semantic Keypoint Detection and Spatio-Temporal Association" in PyTorch.
CRAFT-pytorch - Official implementation of Character Region Awareness for Text Detection (CRAFT)
AlphaPose - Real-Time and Accurate Full-Body Multi-Person Pose Estimation&Tracking System
doctr - docTR (Document Text Recognition) - a seamless, high-performing & accessible library for OCR-related tasks powered by Deep Learning.
mmaction2 - OpenMMLab's Next Generation Video Understanding Toolbox and Benchmark
deep-text-recognition-benchmark - Text recognition (optical character recognition) with deep learning methods, ICCV 2019
deep-high-resolution-net.pytorch - The project is an official implementation of our CVPR2019 paper "Deep High-Resolution Representation Learning for Human Pose Estimation"
iam-crnn-ctc-recognition - IAM Dataset Handwriting Recognition Using CRNN, CTC Loss, DeepSpeech Beam Search, And KenLM Scorer
AdelaiDet - AdelaiDet is an open source toolbox for multiple instance-level detection and recognition tasks.
keras-ocr - A packaged and flexible version of the CRAFT text detector and Keras CRNN recognition model.