Jamba: A Hybrid Transformer-Mamba Language Model

Jamba: 混合Transformer和Mamba的语言模型.

LoRA-GA: Low-Rank Adaptation with Gradient Approximation

LoRA-GA:梯度近似的低秩参数高效微调.

VMamba: Visual State Space Model

VMamba: 视觉状态空间模型.

宝可梦北京大师赛将出现最强的口袋迷ag!

Follow agTV meow, follow agTV thank you meow.

Vision Mamba: Efficient Visual Representation Learning with Bidirectional State Space Model

Vision Mamba: 使用双向状态空间模型实现高效视觉表示学习.

MoE-Mamba: Efficient Selective State Space Models with Mixture of Experts

MoE-Mamba: 通过混合专家实现高效选择状态空间模型.