layout-parser
A Unified Toolkit for Deep Learning Based Document Image Analysis (by Layout-Parser)
simpletransformers
Transformers for Information Retrieval, Text Classification, NER, QA, Language Modelling, Language Generation, T5, Multi-Modal, and Conversational AI (by ThilinaRajapakse)
layout-parser | simpletransformers | |
---|---|---|
7 | 6 | |
5,283 | 4,189 | |
1.6% | 0.4% | |
0.0 | 2.7 | |
10 months ago | about 2 months ago | |
Python | Python | |
Apache License 2.0 | Apache License 2.0 |
The number of mentions indicates the total number of mentions that we've tracked plus the number of user suggested alternatives.
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
layout-parser
Posts with mentions or reviews of layout-parser.
We have used some of these posts to build our list of alternatives
and similar projects. The last one was on 2024-07-30.
- Ask HN: What are you using to parse PDFs for RAG?
-
Crates for converting PDF's into Markdown
I built my own solution using a combination of Tesseract and OpenCV (in python). But even though the source PDF content is computer generated, I still get sporadic OCR errors. After writing my solution, I came across this https://github.com/Layout-Parser/layout-parser which might be a better starting point for dealing with PDFs but I haven't tried it yet.
-
OCR help required
This sound more like a layout parking issue. Look at Layout Parser, it has helped me on many occasions when I was battling to extract info from PDF documents.
- Amateur programmer here. Will Rust be used in backend for software in the future?
-
Extract text from PDF
One of the tools I'm excited about (but haven't used in production) is LayoutParser. It's open-source, and can do some document image analysis especially on non-generic docs.
-
Document Classification
One project that I saw not to long ago which might be useful is this: https://github.com/Layout-Parser/layout-parser
- A Python Library for Document Layout Understanding
simpletransformers
Posts with mentions or reviews of simpletransformers.
We have used some of these posts to build our list of alternatives
and similar projects. The last one was on 2021-12-11.
-
Huggingface is a great idea poorly executed.
You might try this: https://github.com/ThilinaRajapakse/simpletransformers
-
Gpt 2 124m using transformers
https://github.com/ThilinaRajapakse/simpletransformers/blob/master/simpletransformers/language_generation/language_generation_model.py#L146
-
Neural Search Tutorial
Getting embeddings from BERT Encoder
-
Neural Search Step-by-Step
Tutorial includes: - What is the Neural Search? - Getting embeddings from BERT Encoder - Using vector search engine Qdrant - Creating an API server with FastAPI.
-
Document Classification
If you want to do text classification hugging face transformers is great. There's also a simple version for it: https://github.com/ThilinaRajapakse/simpletransformers
-
A Shortly like user interface for GPT 2?
Here is an example script to finetune a GPT-2 model: https://github.com/ThilinaRajapakse/simpletransformers/blob/master/examples/language_generation/fine_tune.py
What are some alternatives?
When comparing layout-parser and simpletransformers you can also consider the following projects:
tika-python - Tika-Python is a Python binding to the Apache Tika™ REST services allowing Tika to be called natively in the Python community.
minGPT - A minimal PyTorch re-implementation of the OpenAI GPT (Generative Pretrained Transformer) training
py-pdf-parser - A Python tool to help extracting information from structured PDFs.
BERTweet - BERTweet: A pre-trained language model for English Tweets (EMNLP-2020)
BCNet - Deep Occlusion-Aware Instance Segmentation with Overlapping BiLayers [CVPR 2021]
kiri - Backprop makes it simple to use, finetune, and deploy state-of-the-art ML models.