pytorch_mgie
VLDet
pytorch_mgie | VLDet | |
---|---|---|
1 | 1 | |
326 | 170 | |
- | - | |
2.6 | 3.1 | |
3 months ago | 2 months ago | |
Python | Python | |
- | GNU General Public License v3.0 or later |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
pytorch_mgie
VLDet
-
[R] [ICLR'2023🌟]: Vision-and-Language Framework for Open-Vocabulary Object Detection
We're excited to share our latest work "Learning Object-Language Alignments for Open-Vocabulary Object Detection", which got accepted to ICLR'2023. Here're some resources: arxiv paper: https://arxiv.org/abs/2211.14843 github: https://github.com/clin1223/VLDet The proposed method called **VLDet**, which is a a simple yet effective vision-and-language framework for open-vocabulary object detection. Our key efforts are: 🔥 We introduce an open-vocabulary object detector method to learn object-language alignments directly from image-text pair data. 🔥 We propose to formulate region-word alignments as a set-matching problem and solve it efficiently with the Hungarian algorithm. 🔥 We use all nouns from image-text pairs as our object voccabulary which is strictly following the open-vocabulary setting and extensive experiments on two benchmark datasets, COCO and LVIS, demonstrate our superior performance.
What are some alternatives?
CLIP-Caption-Reward - PyTorch code for "Fine-grained Image Captioning with CLIP Reward" (Findings of NAACL 2022)
DeepKE - [EMNLP 2022] An Open Toolkit for Knowledge Graph Extraction and Construction
robo-vln - Pytorch code for ICRA'21 paper: "Hierarchical Cross-Modal Agent for Robotics Vision-and-Language Navigation"
VL_adapter - PyTorch code for "VL-Adapter: Parameter-Efficient Transfer Learning for Vision-and-Language Tasks" (CVPR2022)
DDNM - [ICLR 2023 Oral] Zero-Shot Image Restoration Using Denoising Diffusion Null-Space Model
OASIS - Official implementation of the paper "You Only Need Adversarial Supervision for Semantic Image Synthesis" (ICLR 2021)
mmdetection - OpenMMLab Detection Toolbox and Benchmark
yolov5 - YOLOv5 🚀 in PyTorch > ONNX > CoreML > TFLite