On Support Samples of Next Word Prediction

Publication date

2025

Authors

Li, Yuqian
Du, YupeiISNI 0000000493058809
Liu, Yufang
Feng, Feifei
Feng, Mou Xiao
Wu, Yuanbin

Editors

Che, Wanxiang
Nabende, Joyce
Shutova, Ekaterina
Pilehvar, Mohammad Taher

Advisors

Supervisors

Document Type

Part of book
Open Access logo

License

cc_by

Abstract

Language models excel in various tasks by making complex decisions, yet understanding the rationale behind these decisions remains a challenge. This paper investigates data-centric interpretability in language models, focusing on the next-word prediction task. Using representer theorem, we identify two types of support samples-those that either promote or deter specific predictions. Our findings reveal that being a support sample is an intrinsic property, predictable even before training begins. Additionally, while non-support samples are less influential in direct predictions, they play a critical role in preventing overfitting and shaping generalization and representation learning. Notably, the importance of non-support samples increases in deeper layers, suggesting their significant role in intermediate representation formation. These insights shed light on the interplay between data and model decisions, offering a new dimension to understanding language model behavior and interpretability.

Keywords

Language and Linguistics, Linguistics and Language, Computer Science Applications

Citation

Li, Y, Du, Y, Liu, Y, Feng, F, Feng, M X & Wu, Y 2025, On Support Samples of Next Word Prediction. in W Che, J Nabende, E Shutova & M T Pilehvar (eds), Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics : (Volume 1: Long Papers). Proceedings of the Annual Meeting of the Association for Computational Linguistics, vol. 1, Association for Computational Linguistics (ACL), pp. 10277-10289, 63rd Annual Meeting of the Association for Computational Linguistics, ACL 2025, Vienna, Austria, 27/07/25. https://doi.org/10.18653/v1/2025.acl-long.507, conference