Yiwei Zhou
What's new? Analysing language-specific Wikipedia entity contexts to support entity-centric news retrieval
Zhou, Yiwei; Demidova, Elena; Cristea, A.I.
Authors
Contributors
N. Nguyen
Editor
R. Kowalczyk
Editor
A. Pinto
Editor
J. Cardoso
Editor
Abstract
Representation of influential entities, such as celebrities and multinational corporations on the web can vary across languages, re- flecting language-specific entity aspects, as well as divergent views on these entities in different communities. An important source of multilingual background knowledge about influential entities is Wikipedia — an online community-created encyclopaedia — containing more than 280 language editions. Such language-specific information could be applied in entity-centric information retrieval applications, in which users utilise very simple queries, mostly just the entity names, for the relevant documents. In this article we focus on the problem of creating languagespecific entity contexts to support entity-centric, language-specific information retrieval applications. First, we discuss alternative ways such contexts can be built, including Graph-based and Article-based approaches. Second, we analyse the similarities and the differences in these contexts in a case study including 220 entities and five Wikipedia language editions. Third, we propose a context-based entity-centric information retrieval model that maps documents to aspect space, and apply languagespecific entity contexts to perform query expansion. Last, we perform a case study to demonstrate the impact of this model in a news retrieval application. Our study illustrates that the proposed model can effectively improve the recall of entity-centric information retrieval while keeping high precision, and provide language-specific results.
Citation
Zhou, Y., Demidova, E., & Cristea, A. (2017). What's new? Analysing language-specific Wikipedia entity contexts to support entity-centric news retrieval. In N. Nguyen, R. Kowalczyk, A. Pinto, & J. Cardoso (Eds.), Transactions on Computational Collective Intelligence XXVI (2010-231). Springer Verlag. https://doi.org/10.1007/978-3-319-59268-8_10
Online Publication Date | Jun 15, 2017 |
---|---|
Publication Date | Jun 15, 2017 |
Deposit Date | Jul 11, 2018 |
Publicly Available Date | Jul 31, 2018 |
Publisher | Springer Verlag |
Pages | 2010-231 |
Series Title | Lecture notes in computer science |
Series Number | 10190 |
Book Title | Transactions on Computational Collective Intelligence XXVI. |
ISBN | 9783319592671 |
DOI | https://doi.org/10.1007/978-3-319-59268-8_10 |
Public URL | https://durham-repository.worktribe.com/output/1635452 |
Related Public URLs | http://wrap.warwick.ac.uk/85950/ |
Contract Date | Oct 7, 2017 |
Files
Accepted Book Chapter
(717 Kb)
PDF
Copyright Statement
The final publication is available at Springer via https://doi.org/10.1007/978-3-319-59268-8_10
You might also like
Editorial: New challenges and future perspectives in cognitive neuroscience
(2024)
Journal Article
Using deep learning to analyze the psychological effects of COVID-19
(2023)
Journal Article
Downloadable Citations
About Durham Research Online (DRO)
Administrator e-mail: dro.admin@durham.ac.uk
This application uses the following open-source libraries:
SheetJS Community Edition
Apache License Version 2.0 (http://www.apache.org/licenses/)
PDF.js
Apache License Version 2.0 (http://www.apache.org/licenses/)
Font Awesome
SIL OFL 1.1 (http://scripts.sil.org/OFL)
MIT License (http://opensource.org/licenses/mit-license.html)
CC BY 3.0 ( http://creativecommons.org/licenses/by/3.0/)
Powered by Worktribe © 2025
Advanced Search