Chinese CLIP: Contrastive Vision-Language Pretraining in Chinese

Chinese CLIP:中文对比视觉语言预训练.

BLIP-2: Bootstrapping Language-Image Pre-training with Frozen Image Encoders and Large Language Models

BLIP-2:使用冻结图像编码器和大语言模型的引导式语言-图像预训练.

BLIP: Bootstrapping Language-Image Pre-training for Unified Vision-Language Understanding and Generation

BLIP:引导式语言-图像预训练实现统一的视觉-语言理解和生成.

GLIPv2: Unifying Localization and Vision-Language Understanding

GLIPv2:统一定位和视觉语言理解.

(河北篇)保定:推开京畿之门

(Hebei Chapter) Baoding: Openning the Door to Capital.

微调 Grounding DINO 和 Label Studio 进行半自动化目标检测标注

Semiautomatic Image Annotation with Grounding DINO and Label Studio.