DNAGPT: A Generalized Pre-trained Tool for Versatile DNA Sequence Analysis Tasks

DNAGPT: 多功能DNA序列分析任务的通用预训练工具.

A long-context language model for deciphering and generating bacteriophage genomes

用于解码和生成噬菌体基因组的长上下文语言模型.

GENA-LM: A Family of Open-Source Foundational Models for Long DNA Sequences

GENA-LM:长DNA序列的开源基础模型家族.

The Nucleotide Transformer: Building and Evaluating Robust Foundation Models for Human Genomics

Nucleotide Transformer:为人类基因组建立并评估鲁棒的基础模型.

Caduceus: Bi-Directional Equivariant Long-Range DNA Sequence Modeling

Caduceus:双向等效长范围DNA序列建模.

DNA language models are powerful zero-shot predictors of non-coding variant effects

DNA语言模型是非编码变异效应的强大零样本预测器.