Vision, Imaging and Visualisation in Durham (VIViD)

A model based approach for vessel caliber measurement in retinal images (2012)
Presentation / Conference Contribution

Is Unimodal Bias Always Bad for Visual Question Answering? A Medical Domain Study with Dynamic Attention (2022)
Presentation / Conference Contribution

Medical visual question answering (Med-VQA) is to answer medical questions based on clinical images provided. This field is still in its infancy due to the complexity of the trio formed of questions, multimodal features and expert knowledge. In this... Read More about Is Unimodal Bias Always Bad for Visual Question Answering? A Medical Domain Study with Dynamic Attention.

Retinal image analysis aimed at extraction of vascular structure using linear discriminant classifier (2013)
Presentation / Conference Contribution

Smart IoT Cameras for Crowd Analysis based on augmentation for automatic pedestrian detection, simulation and annotation (2019)
Presentation / Conference Contribution

A Comparison of Embedded Deep Learning Methods for Person Detection (2019)
Presentation / Conference Contribution

Learning discriminatory deep clustering models (2019)
Presentation / Conference Contribution

Denoising Diffusion Probabilistic Models for Styled Walking Synthesis (2022)
Presentation / Conference Contribution

Generating realistic motions for digital humans is time-consuming for many graphics applications. Data-driven motion synthesis approaches have seen solid progress in recent years through deep generative models. These results offer high-quality motion... Read More about Denoising Diffusion Probabilistic Models for Styled Walking Synthesis.

MedZip: 3D medical images lossless compressor using recurrent neural network (LSTM) (2021)
Presentation / Conference Contribution

Accurate Deep Net Crowd Counting for Smart IoT Video acquisition devices (2020)
Presentation / Conference Contribution

A deep convolutional auto-encoder with embedded clustering (2018)
Presentation / Conference Contribution

Recognizing conversational interaction based on 3D human pose (2013)
Presentation / Conference Contribution

From clamped local shape models to global shape model (2013)
Presentation / Conference Contribution

Unleashing Transformers: Parallel Token Prediction with Discrete Absorbing Diffusion for Fast High-Resolution Image Generation from Vector-Quantized Codes (2022)
Presentation / Conference Contribution

Whilst diffusion probabilistic models can generate high quality image content, key limitations remain in terms of both generating high-resolution imagery and their associated high computational requirements. Recent Vector-Quantized image models have... Read More about Unleashing Transformers: Parallel Token Prediction with Discrete Absorbing Diffusion for Fast High-Resolution Image Generation from Vector-Quantized Codes.

STIT: Spatio-Temporal Interaction Transformers for Human-Object Interaction Recognition in Videos (2022)
Presentation / Conference Contribution

Recognizing human-object interactions is challenging due to their spatio-temporal changes. We propose the SpatioTemporal Interaction Transformer-based (STIT) network to reason such changes. Specifically, spatial transformers learn humans and objects... Read More about STIT: Spatio-Temporal Interaction Transformers for Human-Object Interaction Recognition in Videos.

Towards Graph Representation Learning Based Surgical Workflow Anticipation (2022)
Presentation / Conference Contribution

Surgical workflow anticipation can give predictions on what steps to conduct or what instruments to use next, which is an essential part of the computer-assisted intervention system for surgery, e.g. workflow reasoning in robotic surgery. However, cu... Read More about Towards Graph Representation Learning Based Surgical Workflow Anticipation.

A Feasibility Study on Image Inpainting for Non-cleft Lip Generation from Patients with Cleft Lip (2022)
Presentation / Conference Contribution

A Cleft lip is a congenital abnormality requiring surgical repair by a specialist. The surgeon must have extensive experience and theoretical knowledge to perform surgery, and Artificial Intelligence (AI) method has been proposed to guide surgeons in... Read More about A Feasibility Study on Image Inpainting for Non-cleft Lip Generation from Patients with Cleft Lip.

Detecting Melanoma Fairly: Skin Tone Detection and Debiasing for Skin Lesion Classification (2022)
Presentation / Conference Contribution

Convolutional Neural Networks have demonstrated human-level performance in the classification of melanoma and other skin lesions, but evident performance disparities between differing skin tones should be addressed before widespread deployment. In th... Read More about Detecting Melanoma Fairly: Skin Tone Detection and Debiasing for Skin Lesion Classification.

3D Reconstruction of Sculptures from Single Images via Unsupervised Domain Adaptation on Implicit Models (2022)
Presentation / Conference Contribution

A Skeleton-aware Graph Convolutional Network for Human-Object Interaction Detection (2022)
Presentation / Conference Contribution

Detecting human-object interactions is essential for comprehensive understanding of visual scenes. In particular, spatial connections between humans and objects are important cues for reasoning interactions. To this end, we propose a skeleton-aware g... Read More about A Skeleton-aware Graph Convolutional Network for Human-Object Interaction Detection.

Multi-view Vision Transformers for Object Detection (2022)
Presentation / Conference Contribution

Outputs (430)