Fine-grained Main Ideas Extraction and Clustering of Online Course Reviews

Xiao, Chenghao; Shi, Lei; Cristea, Alexandra; Li, Zhaoxing; Pan, Ziqi

doi:10.1007/978-3-031-11644-5_24

Fine-grained Main Ideas Extraction and Clustering of Online Course Reviews

Xiao, Chenghao; Shi, Lei; Cristea, Alexandra; Li, Zhaoxing; Pan, Ziqi

Authors

ChengHao Xiao chenghao.xiao@durham.ac.uk
Demonstrator (Ptt)

Lei Shi

Professor Alexandra Cristea alexandra.i.cristea@durham.ac.uk
Professor

Zhaoxing Li zhaoxing.li2@durham.ac.uk
PGR Student Doctor of Philosophy

Ziqi Pan ziqi.pan2@durham.ac.uk
PGR Student Doctor of Philosophy

Contributors

M.M. Rodrigo
Editor

N. Matsuda
Editor

Professor Alexandra Cristea alexandra.i.cristea@durham.ac.uk
Editor

V. Dimitrova
Editor

Abstract

Online course reviews have been an essential way in which course providers could get insights into students’ perceptions about the course quality, especially in the context of massive open online courses (MOOCs), where it is hard for both parties to get further interaction. Analyzing online course reviews is thus an inevitable part for course providers towards the improvement of course quality and the structuring of future courses. However, reading through the often-time thousands of comments and extracting key ideas is not efficient and will potentially incur non-coverage of some important ideas. In this work, we propose a key idea extractor that is based on fine-grained aspect-level semantic units from comments, powered by different variations of state-of-the-art pre-trained language models (PLMs). Our approach differs from both previous topic modeling and keyword extraction methods, which lies in: First, we aim to not only eliminate the heavy reliance on human intervention and statistical characteristics that traditional topic models like LDA are based on, but also to overcome the coarse granularity of state-of-the-art topic models like top2vec. Second, different from previous keyword extraction methods, we do not extract keywords to summarize each comment, which we argue is not necessarily helpful for human readers to grasp key ideas at the course level. Instead, we cluster the ideas and concerns that have been most expressed throughout the whole course, without relying on the verbatimness of students’ wording. We show that this method provides high and stable coverage of students’ ideas.

Citation

Xiao, C., Shi, L., Cristea, A., Li, Z., & Pan, Z. (2022). Fine-grained Main Ideas Extraction and Clustering of Online Course Reviews. In M. Rodrigo, N. Matsuda, A. Cristea, & V. Dimitrova (Eds.), Artificial Intelligence in Education (294-306). Springer, Cham. https://doi.org/10.1007/978-3-031-11644-5_24

Online Publication Date	Jul 27, 2022
Publication Date	2022
Deposit Date	Aug 31, 2022
Publicly Available Date	Jul 28, 2023
Pages	294-306
Series Title	Lecture Notes in Computer Science
Series Number	13355
Book Title	Artificial Intelligence in Education
ISBN	978-3-031-11643-8
DOI	https://doi.org/10.1007/978-3-031-11644-5_24
Public URL	https://durham-repository.worktribe.com/output/1644470
Contract Date	Apr 25, 2022