Exploiting Unlabeled Data with Vision and Language Models for Object Detection 通过视觉和语言模型探索目标检测中的无标签数据.
Open-vocabulary Object Detection via Vision and Language Knowledge Distillation 通过视觉和语言知识蒸馏实现开放词汇目标检测.