Top 3 Python vision-language-transformer Projects
-
GroundingDINO
Official implementation of the paper "Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object Detection"
-
APE
[CVPR 2024] Aligning and Prompting Everything All at Once for Universal Visual Perception (by shenyunhang)
-
InfluxDB
Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
-
UPop
[ICML 2023] UPop: Unified and Progressive Pruning for Compressing Vision-Language Transformers.
Some of the foundation/base models include: * GroundedSAM (Segment Anything Model) * DETIC * GroundingDINO
https://github.com/shenyunhang/APE (super new, idk usability on this one yet)
Project mention: Show HN: Compress vision-language and unimodal AI models by structured pruning | news.ycombinator.com | 2023-07-31
Python vision-language-transformer related posts
Index
What are some of the best open-source vision-language-transformer projects in Python? This list will help you:
Project | Stars | |
---|---|---|
1 | GroundingDINO | 5,075 |
2 | APE | 424 |
3 | UPop | 82 |
Sponsored