Grounded Language-Image Pre-training

对齐语言-图像预训练.

Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object Detection

Grounding DINO:结合DINO与GLIP用于开集目标检测.

开放集合目标检测(Open-Set Object Detection)

Open-Set Object Detection.

Finite Scalar Quantization: VQ-VAE Made Simple

有限标量量化:简化向量量化的变分自编码器.

ABC Easy as 123: A Blind Counter for Exemplar-Free Multi-Class Class-agnostic Counting

无模板多类别类不可知计数的盲计数器.

Effective Whole-body Pose Estimation with Two-stages Distillation

通过两阶段蒸馏实现高效全身姿态估计.