Skip to main content

Research Repository

Advanced Search

Outputs (16)

Tackling Data Bias in Painting Classification with Style Transfer (2023)
Presentation / Conference Contribution
Vijendran, M., Li, F. W., & Shum, H. P. (2023, February). Tackling Data Bias in Painting Classification with Style Transfer. Presented at VISAPP '23: 2023 International Conference on Computer Vision Theory and Applications, Lisbon, Portugal

It is difficult to train classifiers on paintings collections due to model bias from domain gaps and data bias from the uneven distribution of artistic styles. Previous techniques like data distillation, traditional data augmentation and style transf... Read More about Tackling Data Bias in Painting Classification with Style Transfer.

Less is More: Reducing Task and Model Complexity for 3D Point Cloud Semantic Segmentation (2023)
Presentation / Conference Contribution
Li, L., Shum, H. P., & Breckon, T. P. (2023, June). Less is More: Reducing Task and Model Complexity for 3D Point Cloud Semantic Segmentation. Presented at 2023 IEEE/CVF Computer Vision and Pattern Recognition Conference (CVPR), Vancouver, BC

Whilst the availability of 3D LiDAR point cloud data has significantly grown in recent years, annotation remains expensive and time-consuming, leading to a demand for semisupervised semantic segmentation methods with application domains such as auton... Read More about Less is More: Reducing Task and Model Complexity for 3D Point Cloud Semantic Segmentation.

Region-based Appearance and Flow Characteristics for Anomaly Detection in Infrared Surveillance Imagery (2023)
Presentation / Conference Contribution
Gaus, Y., Bhowmik, N., Issac-Medina, B., Atapour-Abarghouei, A., Shum, H., & Breckon, T. (2023, June). Region-based Appearance and Flow Characteristics for Anomaly Detection in Infrared Surveillance Imagery. Presented at IEEE/CVF Conference on Computer Vision and Pattern Recognition 2023, Vancouver, BC

Anomaly detection is a classical problem within automated visual surveillance, namely the determination of the normal from the abnormal when operational data availability is highly biased towards one class (normal) due to both insufficient sample siz... Read More about Region-based Appearance and Flow Characteristics for Anomaly Detection in Infrared Surveillance Imagery.

A Video-Based Augmented Reality System for Human-in-the-Loop Muscle Strength Assessment of Juvenile Dermatomyositis (2023)
Journal Article
Zhou, K., Cai, R., Ma, Y., Tan, Q., Wang, X., Li, J., Shum, H. P., Li, F. W., Jin, S., & Liang, X. (2023). A Video-Based Augmented Reality System for Human-in-the-Loop Muscle Strength Assessment of Juvenile Dermatomyositis. IEEE Transactions on Visualization and Computer Graphics, 29(5), 2456-2466. https://doi.org/10.1109/tvcg.2023.3247092

As the most common idiopathic inflammatory myopathy in children, juvenile dermatomyositis (JDM) is characterized by skin rashes and muscle weakness. The childhood myositis assessment scale (CMAS) is commonly used to measure the degree of muscle invol... Read More about A Video-Based Augmented Reality System for Human-in-the-Loop Muscle Strength Assessment of Juvenile Dermatomyositis.

UAV-ReID: A Benchmark on Unmanned Aerial Vehicle Re-Identification in Video Imagery (2022)
Presentation / Conference Contribution
Organisciak, D., Poyser, M., Alsehaim, A., Hu, S., Isaac-Medina, B. K., Breckon, T. P., & Shum, H. P. UAV-ReID: A Benchmark on Unmanned Aerial Vehicle Re-Identification in Video Imagery. Presented at 2022 17th International Conference on Computer Vision Theory and Applications

As unmanned aerial vehicles (UAV) become more accessible with a growing range of applications, the risk of UAV disruption increases. Recent development in deep learning allows vision-based counter-UAV systems to detect and track UAVs with a single ca... Read More about UAV-ReID: A Benchmark on Unmanned Aerial Vehicle Re-Identification in Video Imagery.

Towards Graph Representation Learning Based Surgical Workflow Anticipation (2022)
Presentation / Conference Contribution
Zhang, X., Al Moubayed, N., & Shum, H. P. (2022, September). Towards Graph Representation Learning Based Surgical Workflow Anticipation. Presented at 2022 IEEE-EMBS International Conference on Biomedical and Health Informatics (BHI), Ioannina, Greece

Surgical workflow anticipation can give predictions on what steps to conduct or what instruments to use next, which is an essential part of the computer-assisted intervention system for surgery, e.g. workflow reasoning in robotic surgery. However, cu... Read More about Towards Graph Representation Learning Based Surgical Workflow Anticipation.

Geometric Features Informed Multi-person Human-object Interaction Recognition in Videos (2022)
Presentation / Conference Contribution
Qiao, T., Men, Q., Li, F. W., Kubotani, Y., Morishima, S., & Shum, H. P. (2022, October). Geometric Features Informed Multi-person Human-object Interaction Recognition in Videos. Presented at Computer Vision - ECCV 2022, Tel Aviv, Israel

Human-Object Interaction (HOI) recognition in videos is important for analysing human activity. Most existing work focusing on visual features usually suffer from occlusion in the real-world scenarios. Such a problem will be further complicated when... Read More about Geometric Features Informed Multi-person Human-object Interaction Recognition in Videos.

MedNeRF: Medical Neural Radiance Fields for Reconstructing 3D-aware CT-Projections from a Single X-ray (2022)
Presentation / Conference Contribution
Corona-Figueroa, A., Frawley, J., Bond-Taylor, S., Bethapudi, S., Shum, H. P., & Willcocks, C. G. (2022, July). MedNeRF: Medical Neural Radiance Fields for Reconstructing 3D-aware CT-Projections from a Single X-ray. Presented at 2022 44th Annual International Conference of the IEEE Engineering in Medicine & Biology Society (EMBC), Glasgow, Scotland

Computed tomography (CT) is an effective med-ical imaging modality, widely used in the field of clinical medicine for the diagnosis of various pathologies. Advances in Multidetector CT imaging technology have enabled additional functionalities, inclu... Read More about MedNeRF: Medical Neural Radiance Fields for Reconstructing 3D-aware CT-Projections from a Single X-ray.

DurLAR: A High-Fidelity 128-Channel LiDAR Dataset with Panoramic Ambient and Reflectivity Imagery for Multi-Modal Autonomous Driving Applications (2021)
Presentation / Conference Contribution
Li, L., Ismail, K. N., Shum, H. P., & Breckon, T. P. (2021, December). DurLAR: A High-Fidelity 128-Channel LiDAR Dataset with Panoramic Ambient and Reflectivity Imagery for Multi-Modal Autonomous Driving Applications. Presented at International Conference on 3D Vision, Surrey / Online

We present DurLAR, a high-fidelity 128-channel 3D LiDAR dataset with panoramic ambient (near infrared) and reflectivity imagery, as well as a sample benchmark task using depth estimation for autonomous driving applications. Our driving platform is eq... Read More about DurLAR: A High-Fidelity 128-Channel LiDAR Dataset with Panoramic Ambient and Reflectivity Imagery for Multi-Modal Autonomous Driving Applications.

Unmanned Aerial Vehicle Visual Detection and Tracking using Deep Neural Networks: A Performance Benchmark (2021)
Presentation / Conference Contribution
Isaac-Medina, B. K., Poyser, M., Organisciak, D., Willcocks, C. G., Breckon, T. P., & Shum, H. P. (2021, October). Unmanned Aerial Vehicle Visual Detection and Tracking using Deep Neural Networks: A Performance Benchmark. Presented at 2021 IEEE/CVF International Conference on Computer Vision Workshops (ICCVW), Montreal, BC, Canada

Unmanned Aerial Vehicles (UAV) can pose a major risk for aviation safety, due to both negligent and malicious use. For this reason, the automated detection and tracking of UAV is a fundamental task in aerial security systems. Common technologies for... Read More about Unmanned Aerial Vehicle Visual Detection and Tracking using Deep Neural Networks: A Performance Benchmark.

STGAE: Spatial-Temporal Graph Auto-Encoder for Hand Motion Denoising (2021)
Presentation / Conference Contribution
Zhou, K., Cheng, Z., Shum, H. P., Li, F. W., & Liang, X. (2021, October). STGAE: Spatial-Temporal Graph Auto-Encoder for Hand Motion Denoising. Presented at 2021 IEEE International Symposium on Mixed and Augmented Reality (ISMAR), Bari, Italy

Hand object interaction in mixed reality (MR) relies on the accurate tracking and estimation of human hands, which provide users with a sense of immersion. However, raw captured hand motion data always contains errors such as joints occlusion, disloc... Read More about STGAE: Spatial-Temporal Graph Auto-Encoder for Hand Motion Denoising.

Facial reshaping operator for controllable face beautification (2020)
Journal Article
Hu, S., Shum, H. P., Liang, X., Li, F. W., & Aslam, N. (2021). Facial reshaping operator for controllable face beautification. Expert Systems with Applications, 167, Article 114067. https://doi.org/10.1016/j.eswa.2020.114067

Posting attractive facial photos is part of everyday life in the social media era. Motivated by the demand, we propose a lightweight method to automatically and efficiently beautify the shapes of both portrait and non-portrait faces in photos, while... Read More about Facial reshaping operator for controllable face beautification.

Cumuliform Cloud Formation Control using Parameter-Predicting Convolutional Neural Network (2020)
Journal Article
Zhang, Z., Ma, Y., Li, Y., Li, F. W., Shum, H. P., Yang, B., Guo, J., & Liang, X. (2020). Cumuliform Cloud Formation Control using Parameter-Predicting Convolutional Neural Network. Graphical Models, 111, Article 101083. https://doi.org/10.1016/j.gmod.2020.101083

Physically-based cloud simulation is an effective approach for synthesizing realistic cloud. However, generating clouds with desired shapes requires a time-consuming process for selecting the appropriate simulation parameters. This paper addresses su... Read More about Cumuliform Cloud Formation Control using Parameter-Predicting Convolutional Neural Network.

Sparse Metric-based Mesh Saliency (2020)
Journal Article
Hu, S., Liang, X., Shum, H. P., Li, F. W., & Aslam, N. (2020). Sparse Metric-based Mesh Saliency. Neurocomputing, 400, 11-23. https://doi.org/10.1016/j.neucom.2020.02.106

In this paper, we propose an accurate and robust approach to salient region detection for 3D polygonal surface meshes. The salient regions of a mesh are those that geometrically stand out from their contexts and therefore are semantically important f... Read More about Sparse Metric-based Mesh Saliency.

A Unified Deep Metric Representation for Mesh Saliency Detection and Non-rigid Shape Matching (2019)
Journal Article
Hu, S., Shum, H., Aslam, N., Li, F. W., & Liang, X. (2020). A Unified Deep Metric Representation for Mesh Saliency Detection and Non-rigid Shape Matching. IEEE Transactions on Multimedia, 22(9), 2278-2292. https://doi.org/10.1109/tmm.2019.2952983

In this paper, we propose a deep metric for unifying the representation of mesh saliency detection and non-rigid shape matching. While saliency detection and shape matching are two closely related and fundamental tasks in shape analysis, previous met... Read More about A Unified Deep Metric Representation for Mesh Saliency Detection and Non-rigid Shape Matching.