Exploiting Unlabeled Data with Vision and Language Models for Object Detection

通过视觉和语言模型探索目标检测中的无标签数据.

目标检测数据集的分析

Analysis on Object Detection Datasets.

Detecting Twenty-thousand Classes using Image-level Supervision

使用图像级监督检测两万个类别.

Open Vocabulary Object Detection with Pseudo Bounding-Box Labels

通过伪边界框标签实现开放词汇目标检测.

Open-vocabulary Object Detection via Vision and Language Knowledge Distillation

通过视觉和语言知识蒸馏实现开放词汇目标检测.

Open-Vocabulary Object Detection Using Captions

使用描述进行开集目标检测.