BCNet
layout-parser
Our great sponsors
BCNet | layout-parser | |
---|---|---|
1 | 6 | |
507 | 4,426 | |
- | 3.0% | |
3.0 | 0.0 | |
10 months ago | about 1 month ago | |
Python | Python | |
MIT License | Apache License 2.0 |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
BCNet
-
[CVPR2021] Deep Occlusion-Aware Instance Segmentation with Overlapping BiLayers
github: https://github.com/lkeab/BCNet
layout-parser
-
Crates for converting PDF's into Markdown
I built my own solution using a combination of Tesseract and OpenCV (in python). But even though the source PDF content is computer generated, I still get sporadic OCR errors. After writing my solution, I came across this https://github.com/Layout-Parser/layout-parser which might be a better starting point for dealing with PDFs but I haven't tried it yet.
-
OCR help required
This sound more like a layout parking issue. Look at Layout Parser, it has helped me on many occasions when I was battling to extract info from PDF documents.
- Amateur programmer here. Will Rust be used in backend for software in the future?
-
Extract text from PDF
One of the tools I'm excited about (but haven't used in production) is LayoutParser. It's open-source, and can do some document image analysis especially on non-generic docs.
-
Document Classification
One project that I saw not to long ago which might be useful is this: https://github.com/Layout-Parser/layout-parser
- A Python Library for Document Layout Understanding
What are some alternatives?
mmdetection - OpenMMLab Detection Toolbox and Benchmark
EasyOCR - Ready-to-use OCR with 80+ supported languages and all popular writing scripts including Latin, Chinese, Arabic, Devanagari, Cyrillic and etc.
yolov7 - Implementation of paper - YOLOv7: Trainable bag-of-freebies sets new state-of-the-art for real-time object detectors
py-pdf-parser - A Python tool to help extracting information from structured PDFs.
yolov7_d2 - π₯π₯π₯π₯ (Earlier YOLOv7 not official one) YOLO with Transformers and Instance Segmentation, with TensorRT acceleration! π₯π₯π₯
tika-python - Tika-Python is a Python binding to the Apache Tikaβ’ REST services allowing Tika to be called natively in the Python community.
medicaldetectiontoolkit - The Medical Detection Toolkit contains 2D + 3D implementations of prevalent object detectors such as Mask R-CNN, Retina Net, Retina U-Net, as well as a training and inference framework focused on dealing with medical images.
ssd_keras - A Keras port of Single Shot MultiBox Detector
PaddleDetection - Object Detection toolkit based on PaddlePaddle. It supports object detection, instance segmentation, multiple object tracking and real-time multi-person keypoint detection.
simpletransformers - Transformers for Information Retrieval, Text Classification, NER, QA, Language Modelling, Language Generation, T5, Multi-Modal, and Conversational AI
AdelaiDet - AdelaiDet is an open source toolbox for multiple instance-level detection and recognition tasks.
shabby-pages - ShabbyPages is a state-of-the-art corpus of born-digital document images with both ground truth and distorted versions appropriate for use in training models to reverse distortions and recover to original denoised documents.