ChengHao Xiao chenghao.xiao@durham.ac.uk
PGR Student Doctor of Philosophy
Analyzing LLMs' Knowledge Boundary Cognition Across Languages Through the Lens of Internal Representations
Xiao, Chenghao; Chan, Hou Pong; Zhang, Hao; Aljunied, Mahani; Bing, Lidong; Al Moubayed, Noura; Rong, Yu
Authors
Hou Pong Chan
Hao Zhang
Mahani Aljunied
Lidong Bing
Dr Noura Al Moubayed noura.al-moubayed@durham.ac.uk
Associate Professor
Yu Rong
Abstract
While understanding the knowledge boundaries of LLMs is crucial to prevent hallucination, research on the knowledge boundaries of LLMs has predominantly focused on English. In this work, we present the first study to analyze how LLMs recognize knowledge boundaries across different languages by probing their internal representations when processing known and unknown questions in multiple languages. Our empirical studies reveal three key findings: 1) LLMs' perceptions of knowledge boundaries are encoded in the middle to middle-upper layers across different languages. 2) Language differences in knowledge boundary perception follow a linear structure, which motivates our proposal of a training-free alignment method that effectively transfers knowledge boundary perception ability across languages, thereby helping reduce hallucination risk in low-resource languages; 3) Fine-tuning on bilingual question pair translation further enhances LLMs' recognition of knowledge boundaries across languages. Given the absence of standard testbeds for cross-lingual knowledge boundary analysis, we construct a multilingual evaluation suite comprising three representative types of knowledge boundary data. Our code and datasets are publicly available at https://github.com/DAMO-NLP-SG/ LLM-Multilingual-Knowledge-Boundaries.
Citation
Xiao, C., Chan, H. P., Zhang, H., Aljunied, M., Bing, L., Al Moubayed, N., & Rong, Y. (2025, July). Analyzing LLMs' Knowledge Boundary Cognition Across Languages Through the Lens of Internal Representations. Presented at Annual Meeting of the Association for Computational Linguistics (ACL), Vienna, Austria
Presentation Conference Type | Conference Paper (published) |
---|---|
Conference Name | Annual Meeting of the Association for Computational Linguistics (ACL) |
Start Date | Jul 27, 2025 |
End Date | Aug 1, 2025 |
Acceptance Date | Jun 2, 2025 |
Deposit Date | Jun 18, 2025 |
Peer Reviewed | Peer Reviewed |
Public URL | https://durham-repository.worktribe.com/output/4106988 |
Publisher URL | https://aclanthology.org/events/acl-2024/ |
This file is under embargo due to copyright reasons.
You might also like
Fine-grained Main Ideas Extraction and Clustering of Online Course Reviews
(2022)
Book Chapter
Length is a Curse and a Blessing for Document-level Semantics
(2023)
Presentation / Conference Contribution
SciMMIR: Benchmarking Scientific Multi-modal Information Retrieval
(2024)
Presentation / Conference Contribution
Downloadable Citations
About Durham Research Online (DRO)
Administrator e-mail: dro.admin@durham.ac.uk
This application uses the following open-source libraries:
SheetJS Community Edition
Apache License Version 2.0 (http://www.apache.org/licenses/)
PDF.js
Apache License Version 2.0 (http://www.apache.org/licenses/)
Font Awesome
SIL OFL 1.1 (http://scripts.sil.org/OFL)
MIT License (http://opensource.org/licenses/mit-license.html)
CC BY 3.0 ( http://creativecommons.org/licenses/by/3.0/)
Powered by Worktribe © 2025
Advanced Search