Grounded Language-Image Pre-training

对齐语言-图像预训练.

Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object Detection

Grounding DINO:结合DINO与GLIP用于开集目标检测.

开放集合目标检测(Open-Set Object Detection)

Open-Set Object Detection.

菏泽一中百廿华诞赋

Heze No.1 Middle School’s 20th Birthday Ode.

PixArt-α: Fast Training of Diffusion Transformer for Photorealistic Text-to-Image Synthesis

PixArt-α: 真实文本到图像合成的扩散Transformer的快速训练.

Finite Scalar Quantization: VQ-VAE Made Simple

有限标量量化:简化向量量化的变分自编码器.