Unifying Nonlocal Blocks for Neural Networks

统一神经网络的非局部模块.

Attention Augmented Convolutional Networks

注意力增强卷积网络.

Dynamic Task Prioritization for Multitask Learning

多任务学习中的动态任务优先级.

Outrageously Large Neural Networks: The Sparsely-Gated Mixture-of-Experts Layer

使用稀疏门控的混合专家系统构建超大规模神经网络.

Region-based Non-local Operation for Video Classification

为视频分类设计的基于区域的非局部网络.

Exploring Self-attention for Image Recognition

探索图像识别的自注意力机制.