Probing Language Models from A Human Behavioral Perspective

doi:10.48550/arXiv.2310.05216

PSYCH OpenIR

	Probing Language Models from A Human Behavioral Perspective
	Wang, Xintong 1; Li, Xiaoyu 2; Li, Xingshan3 ; Biemann, Chris 1
摘要	Large Language Models (LLMs) have emerged as dominant foundational models in modern NLP. However, the understanding of their prediction process and internal mechanisms, such as feed-forward networks and multi-head self-attention, remains largely unexplored. In this study, we probe LLMs from a human behavioral perspective, correlating values from LLMs with eye-tracking measures, which are widely recognized as meaningful indicators of reading patterns. Our findings reveal that LLMs exhibit a prediction pattern distinct from that of RNN-based LMs. Moreover, with the escalation of FFN layers, the capacity for memorization and linguistic knowledge encoding also surges until it peaks, subsequently pivoting to focus on comprehension capacity. The functions of self-attention are distributed across multiple heads. Lastly, we scrutinize the gate mechanisms, finding that they control the flow of information, with some gates promoting, while others eliminating information.
	2023
语种	英语
DOI	10.48550/arXiv.2310.05216
发表期刊	arXiv
期刊论文类型	综述
收录类别	EI
引用统计
文献类型	期刊论文
条目标识符	https://ir.psych.ac.cn/handle/311026/46208
专题	中国科学院心理研究所
作者单位	1.Department of Informatics, Universität Hamburg, Germany 2.Institute of Psychology, Chinese Academy of Sciences, China 3.Department of Informatics, Technische Universität Berlin, Germany
推荐引用方式 GB/T 7714	Wang, Xintong,Li, Xiaoyu,Li, Xingshan,et al. Probing Language Models from A Human Behavioral Perspective[J]. arXiv,2023.
APA	Wang, Xintong,Li, Xiaoyu,Li, Xingshan,&Biemann, Chris.(2023).Probing Language Models from A Human Behavioral Perspective.arXiv.
MLA	Wang, Xintong,et al."Probing Language Models from A Human Behavioral Perspective".arXiv (2023).