DetCLIP: Dictionary-Enriched Visual-Concept Paralleled Pre-training for Open-world Detection

DetCLIP:用于开放世界检测的字典增强视觉概念并行预训练.

Learning Object-Language Alignments for Open-Vocabulary Object Detection

为开放词汇目标检测学习目标-语言对齐.

RegionCLIP: Region-based Language-Image Pretraining

RegionCLIP:基于区域的语言图像预训练.

Exploiting Unlabeled Data with Vision and Language Models for Object Detection

通过视觉和语言模型探索目标检测中的无标签数据.

目标检测数据集的分析

Analysis on Object Detection Datasets.

Detecting Twenty-thousand Classes using Image-level Supervision

使用图像级监督检测两万个类别.