VLDet

[ICLR 2023] PyTorch implementation of VLDet (https://arxiv.org/abs/2211.14843) (by clin1223)

VLDet Alternatives

Similar projects and alternatives to VLDet

  • CLIP-Caption-Reward

    PyTorch code for "Fine-grained Image Captioning with CLIP Reward" (Findings of NAACL 2022)

  • DeepKE

    2 VLDet VS DeepKE

    [EMNLP 2022] An Open Toolkit for Knowledge Graph Extraction and Construction

  • InfluxDB

    Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

    InfluxDB logo
  • robo-vln

    Pytorch code for ICRA'21 paper: "Hierarchical Cross-Modal Agent for Robotics Vision-and-Language Navigation"

  • yolov5

    129 VLDet VS yolov5

    YOLOv5 🚀 in PyTorch > ONNX > CoreML > TFLite

  • OASIS

    1 VLDet VS OASIS

    Official implementation of the paper "You Only Need Adversarial Supervision for Semantic Image Synthesis" (ICLR 2021) (by boschresearch)

  • VL_adapter

    PyTorch code for "VL-Adapter: Parameter-Efficient Transfer Learning for Vision-and-Language Tasks" (CVPR2022)

  • mmdetection

    OpenMMLab Detection Toolbox and Benchmark

  • SaaSHub

    SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives

    SaaSHub logo
NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a better VLDet alternative or higher similarity.

VLDet reviews and mentions

Posts with mentions or reviews of VLDet. We have used some of these posts to build our list of alternatives and similar projects.
  • [R] [ICLR'2023🌟]: Vision-and-Language Framework for Open-Vocabulary Object Detection
    1 project | /r/MachineLearning | 11 Feb 2023
    We're excited to share our latest work "Learning Object-Language Alignments for Open-Vocabulary Object Detection", which got accepted to ICLR'2023. Here're some resources: arxiv paper: https://arxiv.org/abs/2211.14843 github: https://github.com/clin1223/VLDet The proposed method called **VLDet**, which is a a simple yet effective vision-and-language framework for open-vocabulary object detection. Our key efforts are: 🔥 We introduce an open-vocabulary object detector method to learn object-language alignments directly from image-text pair data. 🔥 We propose to formulate region-word alignments as a set-matching problem and solve it efficiently with the Hungarian algorithm. 🔥 We use all nouns from image-text pairs as our object voccabulary which is strictly following the open-vocabulary setting and extensive experiments on two benchmark datasets, COCO and LVIS, demonstrate our superior performance.

Stats

Basic VLDet repo stats
1
169
3.1
about 1 month ago

clin1223/VLDet is an open source project licensed under GNU General Public License v3.0 or later which is an OSI approved license.

The primary programming language of VLDet is Python.


Sponsored
SaaSHub - Software Alternatives and Reviews
SaaSHub helps you find the best software and product alternatives
www.saashub.com