Open-vocabulary Object Detection via Vision and Language Knowledge Distillation 通过视觉和语言知识蒸馏实现开放词汇目标检测.
Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object Detection Grounding DINO:结合DINO与GLIP用于开集目标检测.