Vision, Imaging and Visualisation in Durham (VIViD)

Unifying Human Motion Synthesis and Style Transfer with Denoising Diffusion Probabilistic Models (2023)
Conference Proceeding
Chang, Z., Findlay, E. J., Zhang, H., & Shum, H. P. (2023). Unifying Human Motion Synthesis and Style Transfer with Denoising Diffusion Probabilistic Models. In Proceedings of the 18th International Joint Conference on Computer Vision, Imaging and Computer Graphics Theory and Applications (VISIGRAPP 2023) - GRAPP (64-74). https://doi.org/10.5220/0011631000003417

Generating realistic motions for digital humans is a core but challenging part of computer animations and games, as human motions are both diverse in content and rich in styles. While the latest deep learning approaches have made significant advancem... Read More about Unifying Human Motion Synthesis and Style Transfer with Denoising Diffusion Probabilistic Models.

Tackling Data Bias in Painting Classification with Style Transfer (2023)
Conference Proceeding
Vijendran, M., Li, F. W., & Shum, H. P. (2023). Tackling Data Bias in Painting Classification with Style Transfer. In Proceedings of the 18th International Joint Conference on Computer Vision, Imaging and Computer Graphics Theory and Applications - Volume 5 VISAPP: VISAPP (250-261). https://doi.org/10.5220/0011776600003417

It is difficult to train classifiers on paintings collections due to model bias from domain gaps and data bias from the uneven distribution of artistic styles. Previous techniques like data distillation, traditional data augmentation and style transf... Read More about Tackling Data Bias in Painting Classification with Style Transfer.

Less is More: Reducing Task and Model Complexity for 3D Point Cloud Semantic Segmentation (2023)
Conference Proceeding
Li, L., Shum, H. P., & Breckon, T. P. (2023). Less is More: Reducing Task and Model Complexity for 3D Point Cloud Semantic Segmentation. In 2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR). https://doi.org/10.1109/CVPR52729.2023.00903

Whilst the availability of 3D LiDAR point cloud data has significantly grown in recent years, annotation remains expensive and time-consuming, leading to a demand for semisupervised semantic segmentation methods with application domains such as auton... Read More about Less is More: Reducing Task and Model Complexity for 3D Point Cloud Semantic Segmentation.

Region-based Appearance and Flow Characteristics for Anomaly Detection in Infrared Surveillance Imagery (2023)
Conference Proceeding
Gaus, Y., Bhowmik, N., Issac-Medina, B., Atapour-Abarghouei, A., Shum, H., & Breckon, T. (2023). Region-based Appearance and Flow Characteristics for Anomaly Detection in Infrared Surveillance Imagery. In 2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW). https://doi.org/10.1109/CVPRW59228.2023.00301

Anomaly detection is a classical problem within automated visual surveillance, namely the determination of the normal from the abnormal when operational data availability is highly biased towards one class (normal) due to both insufficient sample siz... Read More about Region-based Appearance and Flow Characteristics for Anomaly Detection in Infrared Surveillance Imagery.

Hierarchical Graph Convolutional Networks for Action Quality Assessment (2023)
Journal Article
Zhou, K., Ma, Y., Shum, H. P., & Liang, X. (2023). Hierarchical Graph Convolutional Networks for Action Quality Assessment. IEEE Transactions on Circuits and Systems for Video Technology, https://doi.org/10.1109/TCSVT.2023.3281413

Action quality assessment (AQA) automatically evaluates how well humans perform actions in a given video, a technique widely used in fields such as rehabilitation medicine, athletic competitions, and specific skills assessment. However, existing work... Read More about Hierarchical Graph Convolutional Networks for Action Quality Assessment.

INCLG: Inpainting for Non-Cleft Lip Generation with a Multi-Task Image Processing Network (2023)
Journal Article
Chen, S., Atapour-Abarghouei, A., Ho, E. S., & Shum, H. P. (2023). INCLG: Inpainting for Non-Cleft Lip Generation with a Multi-Task Image Processing Network. Software impacts, 17, Article 100517. https://doi.org/10.1016/j.simpa.2023.100517

We present a software that predicts non-cleft facial images for patients with cleft lip, thereby facilitating the understanding, awareness and discussion of cleft lip surgeries. To protect patients’ privacy, we design a software framework using image... Read More about INCLG: Inpainting for Non-Cleft Lip Generation with a Multi-Task Image Processing Network.

Focalized Contrastive View-invariant Learning for Self-supervised Skeleton-based Action Recognition (2023)
Journal Article
Men, Q., Ho, E. S., Shum, H. P., & Leung, H. (2023). Focalized Contrastive View-invariant Learning for Self-supervised Skeleton-based Action Recognition. Neurocomputing, 537, 198-209. https://doi.org/10.1016/j.neucom.2023.03.070

Learning view-invariant representation is a key to improving feature discrimination power for skeleton-based action recognition. Existing approaches cannot effectively remove the impact of viewpoint due to the implicit view-dependent representations.... Read More about Focalized Contrastive View-invariant Learning for Self-supervised Skeleton-based Action Recognition.

A Video-Based Augmented Reality System for Human-in-the-Loop Muscle Strength Assessment of Juvenile Dermatomyositis (2023)
Journal Article
Zhou, K., Cai, R., Ma, Y., Tan, Q., Wang, X., Li, J., …Liang, X. (2023). A Video-Based Augmented Reality System for Human-in-the-Loop Muscle Strength Assessment of Juvenile Dermatomyositis. IEEE Transactions on Visualization and Computer Graphics, 29(5), 2456-2466. https://doi.org/10.1109/tvcg.2023.3247092

As the most common idiopathic inflammatory myopathy in children, juvenile dermatomyositis (JDM) is characterized by skin rashes and muscle weakness. The childhood myositis assessment scale (CMAS) is commonly used to measure the degree of muscle invol... Read More about A Video-Based Augmented Reality System for Human-in-the-Loop Muscle Strength Assessment of Juvenile Dermatomyositis.

UAV-ReID: A Benchmark on Unmanned Aerial Vehicle Re-Identification in Video Imagery (2022)
Conference Proceeding
Organisciak, D., Poyser, M., Alsehaim, A., Hu, S., Isaac-Medina, B. K., Breckon, T. P., & Shum, H. P. (2022). UAV-ReID: A Benchmark on Unmanned Aerial Vehicle Re-Identification in Video Imagery. . https://doi.org/10.5220/0010836600003124

As unmanned aerial vehicles (UAV) become more accessible with a growing range of applications, the risk of UAV disruption increases. Recent development in deep learning allows vision-based counter-UAV systems to detect and track UAVs with a single ca... Read More about UAV-ReID: A Benchmark on Unmanned Aerial Vehicle Re-Identification in Video Imagery.

Denoising Diffusion Probabilistic Models for Styled Walking Synthesis (2022)
Conference Proceeding
Findlay, E., Zhang, H., Chang, Z., & Shum, H. P. (2022). Denoising Diffusion Probabilistic Models for Styled Walking Synthesis. . https://doi.org/10.1145/3561975

Generating realistic motions for digital humans is time-consuming for many graphics applications. Data-driven motion synthesis approaches have seen solid progress in recent years through deep generative models. These results offer high-quality motion... Read More about Denoising Diffusion Probabilistic Models for Styled Walking Synthesis.

3D Reconstruction of Sculptures from Single Images via Unsupervised Domain Adaptation on Implicit Models (2022)
Conference Proceeding
Chang, Z., Koulieris, G. A., & Shum, H. P. (2022). 3D Reconstruction of Sculptures from Single Images via Unsupervised Domain Adaptation on Implicit Models. . https://doi.org/10.1145/3562939.3565632

A Skeleton-aware Graph Convolutional Network for Human-Object Interaction Detection (2022)
Conference Proceeding
Zhu, M., Ho, E. S., & Shum, H. P. (2022). A Skeleton-aware Graph Convolutional Network for Human-Object Interaction Detection. . https://doi.org/10.1109/smc53654.2022.9945149

Detecting human-object interactions is essential for comprehensive understanding of visual scenes. In particular, spatial connections between humans and objects are important cues for reasoning interactions. To this end, we propose a skeleton-aware g... Read More about A Skeleton-aware Graph Convolutional Network for Human-Object Interaction Detection.

A Feasibility Study on Image Inpainting for Non-cleft Lip Generation from Patients with Cleft Lip (2022)
Conference Proceeding
Chen, S., Atapour-Abarghouei, A., Kerby, J., Ho, E. S., Sainsbury, D. C., Butterworth, S., & Shum, H. P. (2022). A Feasibility Study on Image Inpainting for Non-cleft Lip Generation from Patients with Cleft Lip. . https://doi.org/10.1109/bhi56158.2022.9926917

A Cleft lip is a congenital abnormality requiring surgical repair by a specialist. The surgeon must have extensive experience and theoretical knowledge to perform surgery, and Artificial Intelligence (AI) method has been proposed to guide surgeons in... Read More about A Feasibility Study on Image Inpainting for Non-cleft Lip Generation from Patients with Cleft Lip.

Towards Graph Representation Learning Based Surgical Workflow Anticipation (2022)
Conference Proceeding
Zhang, X., Al Moubayed, N., & Shum, H. P. (2022). Towards Graph Representation Learning Based Surgical Workflow Anticipation. . https://doi.org/10.1109/bhi56158.2022.9926801

Surgical workflow anticipation can give predictions on what steps to conduct or what instruments to use next, which is an essential part of the computer-assisted intervention system for surgery, e.g. workflow reasoning in robotic surgery. However, cu... Read More about Towards Graph Representation Learning Based Surgical Workflow Anticipation.

Geometric Features Informed Multi-person Human-object Interaction Recognition in Videos (2022)
Conference Proceeding
Qiao, T., Men, Q., Li, F. W., Kubotani, Y., Morishima, S., & Shum, H. P. (2022). Geometric Features Informed Multi-person Human-object Interaction Recognition in Videos. . https://doi.org/10.1007/978-3-031-19772-7_28

Human-Object Interaction (HOI) recognition in videos is important for analysing human activity. Most existing work focusing on visual features usually suffer from occlusion in the real-world scenarios. Such a problem will be further complicated when... Read More about Geometric Features Informed Multi-person Human-object Interaction Recognition in Videos.

Multiclass-SGCN: Sparse Graph-based Trajectory Prediction with Agent Class Embedding (2022)
Conference Proceeding
Li, R., Katsigiannis, S., & Shum, H. P. (2022). Multiclass-SGCN: Sparse Graph-based Trajectory Prediction with Agent Class Embedding. In 2022 IEEE International Conference on Image Processing (ICIP) Proceedings (2346-2350). https://doi.org/10.1109/icip46576.2022.9897644

Trajectory prediction of road users in real-world scenarios is challenging because their movement patterns are stochastic and complex. Previous pedestrian-oriented works have been successful in modelling the complex interactions among pedestrians, bu... Read More about Multiclass-SGCN: Sparse Graph-based Trajectory Prediction with Agent Class Embedding.

A Two-stream Convolutional Network for Musculoskeletal and Neurological Disorders Prediction (2022)
Journal Article
Zhu, M., Men, Q., Ho, E. S., Leung, H., & Shum, H. P. (2022). A Two-stream Convolutional Network for Musculoskeletal and Neurological Disorders Prediction. Journal of Medical Systems, 46(11), Article 76. https://doi.org/10.1007/s10916-022-01857-5

Musculoskeletal and neurological disorders are the most common causes of walking problems among older people, and they often lead to diminished quality of life. Analyzing walking motion data manually requires trained professionals and the evaluations... Read More about A Two-stream Convolutional Network for Musculoskeletal and Neurological Disorders Prediction.

CP-AGCN: Pytorch-based Attention Informed Graph Convolutional Network for Identifying Infants at Risk of Cerebral Palsy (2022)
Journal Article
Zhang, H., Ho, E. S., & Shum, H. P. (2022). CP-AGCN: Pytorch-based Attention Informed Graph Convolutional Network for Identifying Infants at Risk of Cerebral Palsy. Software impacts, 14, Article 100419. https://doi.org/10.1016/j.simpa.2022.100419

Early prediction is clinically considered one of the essential parts of cerebral palsy (CP) treatment. We propose to implement a low-cost and interpretable classification system for supporting CP prediction based on General Movement Assessment (GMA).... Read More about CP-AGCN: Pytorch-based Attention Informed Graph Convolutional Network for Identifying Infants at Risk of Cerebral Palsy.

Pose-based Tremor Classification for Parkinson’s Disease Diagnosis from Video (2022)
Conference Proceeding
Zhang, X., Zhang, H., & Shum, H. P. (2022). Pose-based Tremor Classification for Parkinson’s Disease Diagnosis from Video. . https://doi.org/10.1007/978-3-031-16440-8_47

Parkinson’s disease (PD) is a progressive neurodegenerative disorder that results in a variety of motor dysfunction symptoms, including tremors, bradykinesia, rigidity and postural instability. The diagnosis of PD mainly relies on clinical experience... Read More about Pose-based Tremor Classification for Parkinson’s Disease Diagnosis from Video.

MedNeRF: Medical Neural Radiance Fields for Reconstructing 3D-aware CT-Projections from a Single X-ray (2022)
Conference Proceeding
Corona-Figueroa, A., Frawley, J., Bond-Taylor, S., Bethapudi, S., Shum, H. P., & Willcocks, C. G. (2022). MedNeRF: Medical Neural Radiance Fields for Reconstructing 3D-aware CT-Projections from a Single X-ray. . https://doi.org/10.1109/embc48229.2022.9871757

Computed tomography (CT) is an effective med-ical imaging modality, widely used in the field of clinical medicine for the diagnosis of various pathologies. Advances in Multidetector CT imaging technology have enabled additional functionalities, inclu... Read More about MedNeRF: Medical Neural Radiance Fields for Reconstructing 3D-aware CT-Projections from a Single X-ray.