MDETR -- Modulated Detection for End-to-End Multi-Modal Understanding

MDETR:用于端到端多模态理解的调制检测.

Towards Open-Set Object Detection and Discovery

面向开集目标检测与挖掘.

Grounded Language-Image Pre-training

对齐语言-图像预训练.

Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object Detection

Grounding DINO:结合DINO与GLIP用于开集目标检测.

开放集合目标检测(Open-Set Object Detection)

Open-Set Object Detection.

Finite Scalar Quantization: VQ-VAE Made Simple

有限标量量化:简化向量量化的变分自编码器.