VLDet Alternatives
Similar projects and alternatives to VLDet
-
CLIP-Caption-Reward
PyTorch code for "Fine-grained Image Captioning with CLIP Reward" (Findings of NAACL 2022)
-
InfluxDB
Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
-
robo-vln
Pytorch code for ICRA'21 paper: "Hierarchical Cross-Modal Agent for Robotics Vision-and-Language Navigation"
-
OASIS
Official implementation of the paper "You Only Need Adversarial Supervision for Semantic Image Synthesis" (ICLR 2021) (by boschresearch)
-
VL_adapter
PyTorch code for "VL-Adapter: Parameter-Efficient Transfer Learning for Vision-and-Language Tasks" (CVPR2022)
-
SaaSHub
SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives
VLDet reviews and mentions
-
[R] [ICLR'2023🌟]: Vision-and-Language Framework for Open-Vocabulary Object Detection
We're excited to share our latest work "Learning Object-Language Alignments for Open-Vocabulary Object Detection", which got accepted to ICLR'2023. Here're some resources: arxiv paper: https://arxiv.org/abs/2211.14843 github: https://github.com/clin1223/VLDet The proposed method called **VLDet**, which is a a simple yet effective vision-and-language framework for open-vocabulary object detection. Our key efforts are: 🔥 We introduce an open-vocabulary object detector method to learn object-language alignments directly from image-text pair data. 🔥 We propose to formulate region-word alignments as a set-matching problem and solve it efficiently with the Hungarian algorithm. 🔥 We use all nouns from image-text pairs as our object voccabulary which is strictly following the open-vocabulary setting and extensive experiments on two benchmark datasets, COCO and LVIS, demonstrate our superior performance.
Stats
clin1223/VLDet is an open source project licensed under GNU General Public License v3.0 or later which is an OSI approved license.
The primary programming language of VLDet is Python.
Popular Comparisons
Sponsored