Skip to main content

Research Repository

Advanced Search

Outputs (663)

Deep Generative Modelling: A Comparative Review of VAEs, GANs, Normalizing Flows, Energy-Based and Autoregressive Models (2021)
Journal Article
Bond-Taylor, S., Leach, A., Long, Y., & Willcocks, C. G. (2021). Deep Generative Modelling: A Comparative Review of VAEs, GANs, Normalizing Flows, Energy-Based and Autoregressive Models. IEEE Transactions on Pattern Analysis and Machine Intelligence, 44(11), 7327-7347. https://doi.org/10.1109/tpami.2021.3116668

Deep generative models are a class of techniques that train deep neural networks to model the distribution of training samples. Research has fragmented into various interconnected approaches, each of which make trade-offs including run-time, diversit... Read More about Deep Generative Modelling: A Comparative Review of VAEs, GANs, Normalizing Flows, Energy-Based and Autoregressive Models.

Short-term effects of caffeine intake on binocular accommodative facility: a quantitative and qualitative analysis (2021)
Journal Article
Redondo, B., Vera, J., Koulieris, G., Molina-Romero, R., & Jiménez, R. (2022). Short-term effects of caffeine intake on binocular accommodative facility: a quantitative and qualitative analysis. Clinical and Experimental Optometry, 105(5), 534-538. https://doi.org/10.1080/08164622.2021.1935218

Clinical relevance: Caffeine intake has been demonstrated to influence several physiological measures, including some related to eye physiology. The ability to focus at different distances is of paramount importance in real-world situations, and thus... Read More about Short-term effects of caffeine intake on binocular accommodative facility: a quantitative and qualitative analysis.

Eye Tracking Interaction on Unmodified Mobile VR Headsets Using the Selfie Camera (2021)
Journal Article
Drakopoulos, P., Koulieris, G., & Mania, K. (2021). Eye Tracking Interaction on Unmodified Mobile VR Headsets Using the Selfie Camera. ACM Transactions on Applied Perception, 18(3), 1-20. https://doi.org/10.1145/3456875

Input methods for interaction in smartphone-based virtual and mixed reality (VR/MR) are currently based on uncomfortable head tracking controlling a pointer on the screen. User fixations are a fast and natural input method for VR/MR interaction. Prev... Read More about Eye Tracking Interaction on Unmodified Mobile VR Headsets Using the Selfie Camera.

Curvature-based feature selection with application in classifying electronic health records (2021)
Journal Article
Zuo, Z., Li, J., Xu, H., & Al Moubayed, N. (2021). Curvature-based feature selection with application in classifying electronic health records. Technological Forecasting and Social Change, 173, Article 121127. https://doi.org/10.1016/j.techfore.2021.121127

Disruptive technologies provides unparalleled opportunities to contribute to the identifications of many aspects in pervasive healthcare, from the adoption of the Internet of Things through to Machine Learning (ML) techniques. As a powerful tool, ML... Read More about Curvature-based feature selection with application in classifying electronic health records.

A Privacy-Preserving Efficient Location-Sharing Scheme for Mobile Online Social Network Applications (2020)
Journal Article
Bhattacharya, M., Roy, S., Mistry, K., Shum, H. P., & Chattopadhyay, S. (2020). A Privacy-Preserving Efficient Location-Sharing Scheme for Mobile Online Social Network Applications. IEEE Access, 8, https://doi.org/10.1109/access.2020.3043621

The rapid development of mobile internet technology and the better availability of GPS have made mobile online social networks (mOSNs) more popular than traditional online social networks (OSNs) over the last few years. They necessitate fundamental s... Read More about A Privacy-Preserving Efficient Location-Sharing Scheme for Mobile Online Social Network Applications.

Examining the Validity of a New Method for the Objective Assessment of Binocular Accommodative Facility (2Q-AF Test): A Comparison with ± 2.00 DS Lens Flippers (2021)
Journal Article
Vera, J., Redondo, B., Koulieris, G., Molina, R., & Jiménez, R. (2022). Examining the Validity of a New Method for the Objective Assessment of Binocular Accommodative Facility (2Q-AF Test): A Comparison with ± 2.00 DS Lens Flippers. Current Eye Research, 47(1), 62-68. https://doi.org/10.1080/02713683.2021.1962359

Purpose: Recent technological advances have permitted to objectively record the accommodative response while shifting between two different levels of accommodation. This study is aimed at examining the concurrent validity of a new objective method fo... Read More about Examining the Validity of a New Method for the Objective Assessment of Binocular Accommodative Facility (2Q-AF Test): A Comparison with ± 2.00 DS Lens Flippers.

Cross-Domain Structure Preserving Projection for Heterogeneous Domain Adaptation (2021)
Journal Article
Wang, Q., & Breckon, T. (2022). Cross-Domain Structure Preserving Projection for Heterogeneous Domain Adaptation. Pattern Recognition, 123, Article 108362. https://doi.org/10.1016/j.patcog.2021.108362

Heterogeneous Domain Adaptation (HDA) addresses the transfer learning problems where data from the source and target domains are of different modalities (e.g., texts and images) or feature dimensions (e.g., features extracted with different methods).... Read More about Cross-Domain Structure Preserving Projection for Heterogeneous Domain Adaptation.

Babbling brook to thunderous torrent: Using sound to monitor river stage (2021)
Journal Article
Osborne, W. A., Hodge, R. A., Love, G. D., Hawkin, P., & Hawkin, R. E. (2021). Babbling brook to thunderous torrent: Using sound to monitor river stage. Earth Surface Processes and Landforms, 46(13), 2656-2670. https://doi.org/10.1002/esp.5199

The passive, ambient sound above the water from a river has previously untapped potential for determining flow characteristics such as stage. Measuring sub-aerial sound could provide a new, efficient way to continuously monitor river stage, without t... Read More about Babbling brook to thunderous torrent: Using sound to monitor river stage.

Temporal and Non-Temporal Contextual Saliency Analysis for Generalized Wide-Area Search within Unmanned Aerial Vehicle (UAV) Video (2021)
Journal Article
Gökstorp, S., & Breckon, T. (2022). Temporal and Non-Temporal Contextual Saliency Analysis for Generalized Wide-Area Search within Unmanned Aerial Vehicle (UAV) Video. Visual Computer, 38(6), 2033-2040. https://doi.org/10.1007/s00371-021-02264-6

Unmanned Aerial Vehicles (UAV) can be used to great effect for wide-area searches such as search and rescue operations. UAV enable search and rescue teams to cover large areas more efficiently and in less time. However, using UAV for this purpose inv... Read More about Temporal and Non-Temporal Contextual Saliency Analysis for Generalized Wide-Area Search within Unmanned Aerial Vehicle (UAV) Video.

3D car shape reconstruction from a contour sketch using GAN and lazy learning (2021)
Journal Article
Nozawa, N., Shum, H. P., Feng, Q., Ho, E. S., & Morishima, S. (2022). 3D car shape reconstruction from a contour sketch using GAN and lazy learning. Visual Computer, 38(4), 1317-1330. https://doi.org/10.1007/s00371-020-02024-y

3D car models are heavily used in computer games, visual effects, and even automotive designs. As a result, producing such models with minimal labour costs is increasingly more important. To tackle the challenge, we propose a novel system to reconstr... Read More about 3D car shape reconstruction from a contour sketch using GAN and lazy learning.

Two-stage human verification using HandCAPTCHA and anti-spoofed finger biometrics with feature selection (2021)
Journal Article
Bera, A., Bhattacharjee, D., & Shum, H. P. (2021). Two-stage human verification using HandCAPTCHA and anti-spoofed finger biometrics with feature selection. Expert Systems with Applications, 171, https://doi.org/10.1016/j.eswa.2021.114583

This paper presents a human verification scheme in two independent stages to overcome the vulnerabilities of attacks and to enhance security. At the first stage, a hand image-based CAPTCHA (HandCAPTCHA) is tested to avert automated bot-attacks on the... Read More about Two-stage human verification using HandCAPTCHA and anti-spoofed finger biometrics with feature selection.

Spoofing Detection on Hand Images Using Quality Assessment (2021)
Journal Article
Bera, A., Dey, R., Bhattacharjee, D., Nasipuri, M. *., & Shum, H. (2021). Spoofing Detection on Hand Images Using Quality Assessment. Multimedia Tools and Applications, 80(19), 28603-28626. https://doi.org/10.1007/s11042-021-10976-z

Recent research on biometrics focuses on achieving a high success rate of authentication and addressing the concern of various spoofing attacks. Although hand geometry recognition provides adequate security over unauthorized access, it is susceptible... Read More about Spoofing Detection on Hand Images Using Quality Assessment.

A plug-in attribute correction module for generalized zero-shot learning (2020)
Journal Article
Zhang, H., Bai, H., Long, Y., Liu, L., & Shao, L. (2021). A plug-in attribute correction module for generalized zero-shot learning. Pattern Recognition, 112, Article 107767. https://doi.org/10.1016/j.patcog.2020.107767

While Zero Shot Learning models can recognize new classes without training examples, they often fails to incorporate both seen and unseen classes together at the test time, which is known as the Generalized Zero-shot Learning (GZSL) problem. This pap... Read More about A plug-in attribute correction module for generalized zero-shot learning.

Improving Current Glycated Hemoglobin Prediction in Adults: Use of Machine Learning Algorithms with Electronic Health Records (2021)
Journal Article
Alhassan, Z., Watson, M., Budgen, D., Alshammari, R., Alessa, A., & Al Moubayed, N. (2021). Improving Current Glycated Hemoglobin Prediction in Adults: Use of Machine Learning Algorithms with Electronic Health Records. JMIR Medical Informatics, 9(5), Article e25237. https://doi.org/10.2196/25237

Background: Predicting the risk of glycated hemoglobin (HbA1c) elevation can help identify patients with the potential for developing serious chronic health problems such as diabetes. Early preventive interventions based upon advanced predictive mode... Read More about Improving Current Glycated Hemoglobin Prediction in Adults: Use of Machine Learning Algorithms with Electronic Health Records.

Facial reshaping operator for controllable face beautification (2020)
Journal Article
Hu, S., Shum, H. P., Liang, X., Li, F. W., & Aslam, N. (2021). Facial reshaping operator for controllable face beautification. Expert Systems with Applications, 167, Article 114067. https://doi.org/10.1016/j.eswa.2020.114067

Posting attractive facial photos is part of everyday life in the social media era. Motivated by the demand, we propose a lightweight method to automatically and efficiently beautify the shapes of both portrait and non-portrait faces in photos, while... Read More about Facial reshaping operator for controllable face beautification.

Multi-task Deep Learning with Optical Flow Features for Self-Driving Cars (2020)
Journal Article
Hu, Y., Shum, H. P., & Ho, E. S. (2020). Multi-task Deep Learning with Optical Flow Features for Self-Driving Cars. IET Intelligent Transport Systems, 14(13), 1845-1854. https://doi.org/10.1049/iet-its.2020.0439

The control of self-driving cars has received growing attention recently. Although existing research shows promising results in the vehicle control using video from a monocular dash camera, there has been very limited work on directly learning vehicl... Read More about Multi-task Deep Learning with Optical Flow Features for Self-Driving Cars.

Real-Time Posture Reconstruction for Microsoft Kinect (2013)
Journal Article
Shum, H. P., Ho, E. S., Jiang, Y., & Takagi, S. (2013). Real-Time Posture Reconstruction for Microsoft Kinect. IEEE Transactions on Cybernetics, 43(5), 1357-1369. https://doi.org/10.1109/tcyb.2013.2275945

The recent advancement of motion recognition using Microsoft Kinect stimulates many new ideas in motion capture and virtual reality applications. Utilizing a pattern recognition algorithm, Kinect can determine the positions of different body parts fr... Read More about Real-Time Posture Reconstruction for Microsoft Kinect.

Kinect Posture Reconstruction Based on a Local Mixture of Gaussian Process Models (2015)
Journal Article
Liu, Z., Zhou, L., Leung, H., & Shum, H. P. (2016). Kinect Posture Reconstruction Based on a Local Mixture of Gaussian Process Models. IEEE Transactions on Visualization and Computer Graphics, 22(11), 2437-2450. https://doi.org/10.1109/tvcg.2015.2510000

Depth sensor based 3D human motion estimation hardware such as Kinect has made interactive applications more popular recently. However, it is still challenging to accurately recognize postures from a single depth camera due to the inherently noisy da... Read More about Kinect Posture Reconstruction Based on a Local Mixture of Gaussian Process Models.

Automatic Musculoskeletal and Neurological Disorder Diagnosis With Relative Joint Displacement From Human Gait (2018)
Journal Article
Rueangsirarak, W., Zhang, J., Aslam, N., Ho, E. S., & Shum, H. P. (2018). Automatic Musculoskeletal and Neurological Disorder Diagnosis With Relative Joint Displacement From Human Gait. IEEE Transactions on Neural Systems and Rehabilitation Engineering, 26(12), 2387-2396. https://doi.org/10.1109/tnsre.2018.2880871

Musculoskeletal and neurological disorders are common devastating companions of ageing, leading to a reduction in quality of life and increased mortality. Gait analysis is a popular method for diagnosing these disorders. However, manually analyzing t... Read More about Automatic Musculoskeletal and Neurological Disorder Diagnosis With Relative Joint Displacement From Human Gait.