郑之杰的个人博客
欢迎光临
Beyond the Imitation Game: Quantifying and extrapolating the capabilities of language models
超越模仿游戏:量化和推断语言模型的能力.
Training Compute-Optimal Large Language Models
训练计算最优的大型语言模型.
Locating and Editing Factual Associations in GPT
定位和编辑GPT中的事实关联.
Modifying Memories in Transformer Models
修正Transformer模型中的记忆.
Towards TracIng Factual Knowledge in Language Models Back to the Training Data
将语言模型中的事实知识追溯到训练数据.
On the Role of Bidirectionality in Language Model Pre-Training
探讨语言模型预训练中的双向性.