Systems and Algorithms for Convolutional Multi-Hybrid Language Models at Scale

大规模卷积多混合语言模型的系统与算法.

SigLIP 2: Multilingual Vision-Language Encoders with Improved Semantic Understanding, Localization, and Dense Features

SigLIP 2:使用改进的语义理解、定位和密集特征的多模态视觉语言编码器.

The Curse of Depth in Large Language Models

大语言模型中的深度诅咒.

科普记:释放想象力吧!绕地球一圈的光线

Unleash your imagination! Light circling the Earth.

The GAN is dead; long live the GAN! A Modern GAN Baseline

GAN 已死;GAN 万岁!现代 GAN 基线.

Simple Hardware-Efficient Long Convolutions for Sequence Modeling

用于序列建模的简单的硬件高效长卷积.