A long-context language model for deciphering and generating bacteriophage genomes

用于解码和生成噬菌体基因组的长上下文语言模型.

GENA-LM: A Family of Open-Source Foundational Models for Long DNA Sequences

GENA-LM:长DNA序列的开源基础模型家族.

The Nucleotide Transformer: Building and Evaluating Robust Foundation Models for Human Genomics

Nucleotide Transformer:为人类基因组建立并评估鲁棒的基础模型.

Caduceus: Bi-Directional Equivariant Long-Range DNA Sequence Modeling

Caduceus:双向等效长范围DNA序列建模.

DNA language models are powerful zero-shot predictors of non-coding variant effects

DNA语言模型是非编码变异效应的强大零样本预测器.

DNABERT-2: Efficient Foundation Model and Benchmark For Multi-Species Genome

DNABERT-2:多物种基因组的高效基础模型和基准.