Skip to main content

Research Repository

Advanced Search

Curvature-based feature selection with application in classifying electronic health records

Zuo, Zheming; Li, Jie; Xu, Han; Al Moubayed, Noura

Authors

Zheming Zuo

Jie Li

Han Xu



Abstract

Disruptive technologies provides unparalleled opportunities to contribute to the identifications of many aspects in pervasive healthcare, from the adoption of the Internet of Things through to Machine Learning (ML) techniques. As a powerful tool, ML has been widely applied in patient-centric healthcare solutions. To further improve the quality of patient care, Electronic Health Records (EHRs) are commonly adopted in healthcare facilities for analysis. It is a crucial task to apply AI and ML to analyse those EHRs for prediction and diagnostics due to their highly unstructured, unbalanced, incomplete, and high-dimensional nature. Dimensionality reduction is a common data preprocessing technique to cope with high-dimensional EHR data, which aims to reduce the number of features of EHR representation while improving the performance of the subsequent data analysis, e.g. classification. In this work, an efficient filter-based feature selection method, namely Curvature-based Feature Selection (CFS), is presented. The proposed CFS applied the concept of Menger Curvature to rank the weights of all features in the given data set. The performance of the proposed CFS has been evaluated in four well-known EHR data sets, including Cervical Cancer Risk Factors (CCRFDS), Breast Cancer Coimbra (BCCDS), Breast Tissue (BTDS), and Diabetic Retinopathy Debrecen (DRDDS). The experimental results show that the proposed CFS achieved state-of-the-art performance on the above data sets against conventional PCA and other most recent approaches. The source code of the proposed approach is publicly available at https://github.com/zhemingzuo/CFS.

Citation

Zuo, Z., Li, J., Xu, H., & Al Moubayed, N. (2021). Curvature-based feature selection with application in classifying electronic health records. Technological Forecasting and Social Change, 173, Article 121127. https://doi.org/10.1016/j.techfore.2021.121127

Journal Article Type Article
Acceptance Date Aug 14, 2021
Online Publication Date Sep 8, 2021
Publication Date 2021-12
Deposit Date Sep 15, 2021
Journal Journal of Technological Forecasting and Social Change
Print ISSN 0040-1625
Electronic ISSN 1873-5509
Publisher Elsevier
Peer Reviewed Peer Reviewed
Volume 173
Article Number 121127
DOI https://doi.org/10.1016/j.techfore.2021.121127
Public URL https://durham-repository.worktribe.com/output/1234443
Related Public URLs https://www.sciencedirect.com/science/article/pii/S0040162521005606?CMX_ID=&SIS_ID=&dgcid=STMJ_AUTH_SERV_PUBLISHED&utm_acid=144050121&utm_campaign=STMJ_AUTH_SERV_PUBLISHED&utm_in=DM180567&utm_medium=email&utm_source=AC_