table-transformer
superduperdb
table-transformer | superduperdb | |
---|---|---|
9 | 24 | |
1,869 | 4,415 | |
8.3% | 3.7% | |
6.1 | 9.9 | |
5 months ago | 4 days ago | |
Python | Python | |
MIT License | Apache License 2.0 |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
table-transformer
-
Data extraction from pdf
Saw this last time but never played with it https://github.com/microsoft/table-transformer
- FLaNK Stack Weekly 11 Dec 2023
-
[P] OCR + Table Extraction Advice
Have you tried the SOTA on Table Detection and Extraction with out of the box model weights?
-
How do you parse tables in PDF with langchain? Especially, the context which is few lines above and below the table.
https://huggingface.co/blog/document-ai https://github.com/microsoft/table-transformer https://github.com/google-research/pix2struct https://github.com/PaddlePaddle/PaddleOCR/blob/release/2.6/ppstructure/table/README.md
-
[D] Unimpressive improvement in training speed after upgrading from GTX 980 Ti to RTX 4090
GPU is at 100% although a bit spiky (dropping occasionally to 50%), I expect that to be normal? I use the same configuration as the authors, num_workers is set to 1 https://github.com/microsoft/table-transformer/blob/main/src/structure_config.json . Data is on a separate SSD, C-drive is a NVMe SSD
- Microsoft TableTransformer
- DeepDoctection
superduperdb
- FLaNK Stack Weekly 12 February 2024
- FLaNK Stack Weekly 11 Dec 2023
- Trending on GitHub top 10 globally for the 4th day in a row: Open-source framework for integrating OpenAI with major databases
- Trending on GitHub top 10 for the 4th day in a row: Open-source framework for integrating AI models and APIs directly with all major SQL databases
-
Trending on GitHub top 10 for the 4th day in a row and official technology partner of MongoDB: Open-source framework for integrating AI with MongoDB and MongoDB Atlas
Definitely check it out: https://github.com/SuperDuperDB/superduperdb and find it here: https://cloud.mongodb.com/ecosystem/
-
Trending on GitHub top 10 globally for the 4th day in a row: Open-source framework for integrating OpenAI and GPT with major databases
Build a chatbot with OpenAI: https://github.com/SuperDuperDB/superduperdb/blob/main/examples/question_the_docs.ipynb
- SuperDuperDB - how to use it to talk to your documents locally using llama 7B or Mistral 7B?
-
Trending on GitHub globally 3 days in a row: SuperDuperDB, a framework for integrating AI with major databases (making them super-duper)
It is for building AI (into your) apps easily without complex pipelines and make your database intelligent (including vector search), definitely check it out: https://github.com/SuperDuperDB/superduperdb
-
🔮 SuperDuperDB is #3 on GitHub Trending globally! 🥉
VentureBeat already covered the launch This is our website This is our main GitHub repository
What are some alternatives?
pix2struct
ds2 - Easiest way to use AI models without coding (Web UI & API support)
CascadeTabNet - This repository contains the code and implementation details of the CascadeTabNet paper "CascadeTabNet: An approach for end to end table detection and structure recognition from image-based documents"
best-of-ml-python - 🏆 A ranked list of awesome machine learning Python libraries. Updated weekly.
PaddleOCR - Awesome multilingual OCR toolkits based on PaddlePaddle (practical ultra lightweight OCR system, support 80+ languages recognition, provide data annotation and synthesis tools, support training and deployment among server, mobile, embedded and IoT devices)
metaflow - :rocket: Build and manage real-life ML, AI, and data science projects with ease!
FLaNK-Ice - Apache Iceberg - Cloud Data Lakehouse
nyc_traffic_flask - Flask App with leaflet.js that can perform NYC Traffic Prediction
llama - Inference code for Llama models
Artificial-Intelligence-Deep-Learning-Machine-Learning-Tutorials - A comprehensive list of Deep Learning / Artificial Intelligence and Machine Learning tutorials - rapidly expanding into areas of AI/Deep Learning / Machine Vision / NLP and industry specific areas such as Climate / Energy, Automotives, Retail, Pharma, Medicine, Healthcare, Policy, Ethics and more.
camelot - Camelot: PDF Table Extraction for Humans
mlops-python-package - Kickstart your MLOps initiative with a flexible, robust, and productive Python package.