Professor Hubert Shum

TraIL-Det: Transformation-Invariant Local Feature Networks for 3D LiDAR Object Detection with Unsupervised Pre-Training (2024)
Presentation / Conference Contribution
Li, L., Qiao, T., Shum, H. P. H., & Breckon, T. P. (2024, November). TraIL-Det: Transformation-Invariant Local Feature Networks for 3D LiDAR Object Detection with Unsupervised Pre-Training. Presented at BMVC'24: The 35th British Machine Vision Conference, Glasgow, UK

MxT: Mamba x Transformer for Image Inpainting (2024)
Presentation / Conference Contribution
Chen, S., Atapour-Abarghouei, A., Zhang, H., & Shum, H. P. H. (2024, November). MxT: Mamba x Transformer for Image Inpainting. Presented at BMVC 2024: The 35th British Machine Vision Conference, Glasgow, UK

Image inpainting, or image completion, is a crucial task in computer vision that aims to restore missing or damaged regions of images with semantically coherent content. This technique requires a precise balance of local texture replication and globa... Read More about MxT: Mamba x Transformer for Image Inpainting.

Artificial intelligence for geometry-based feature extraction, analysis and synthesis in artistic images: a survey (2024)
Journal Article
Vijendran, M., Deng, J., Chen, S., Ho, E. S. L., & Shum, H. P. H. (2025). Artificial intelligence for geometry-based feature extraction, analysis and synthesis in artistic images: a survey. Artificial Intelligence Review, 58(2), Article 64. https://doi.org/10.1007/s10462-024-11051-3

Artificial Intelligence significantly enhances the visual art industry by analyzing, identifying and generating digitized artistic images. This review highlights the substantial benefits of integrating geometric data into AI models, addressing challe... Read More about Artificial intelligence for geometry-based feature extraction, analysis and synthesis in artistic images: a survey.

Adaptive Graph Learning from Spatial Information for Surgical Workflow Anticipation (2024)
Journal Article
Zhang, F. X., Deng, J., Lieck, R., & Shum, H. P. (online). Adaptive Graph Learning from Spatial Information for Surgical Workflow Anticipation. IEEE Transactions on Medical Robotics and Bionics, https://doi.org/10.1109/TMRB.2024.3517137

Neural-code PIFu: High-fidelity Single Image 3D Human Reconstruction via Neural Code Integration (2024)
Presentation / Conference Contribution
Liu, R., Remagnino, P., & Shum, H. P. (2024, December). Neural-code PIFu: High-fidelity Single Image 3D Human Reconstruction via Neural Code Integration. Presented at 2024 International Conference on Pattern Recognition, Kolkata, India

We introduce neural-code PIFu, a novel implicit function for 3D human reconstruction, leveraging neural codebooks, our approach learns recurrent patterns in the feature space and reuses them to improve current features. Many existing methods predict... Read More about Neural-code PIFu: High-fidelity Single Image 3D Human Reconstruction via Neural Code Integration.

From Category to Scenery: An End-to-End Framework for Multi-Person Human-Object Interaction Recognition in Videos (2024)
Presentation / Conference Contribution
Qiao, T., Li, R., Li, F. W. B., & Shum, H. P. H. (2024, December). From Category to Scenery: An End-to-End Framework for Multi-Person Human-Object Interaction Recognition in Videos. Presented at ICPR 2024: International Conference on Pattern Recognition, Kolkata, India

Video-based Human-Object Interaction (HOI) recognition explores the intricate dynamics between humans and objects, which are essential for a comprehensive understanding of human behavior and intentions. While previous work has made significant stride... Read More about From Category to Scenery: An End-to-End Framework for Multi-Person Human-Object Interaction Recognition in Videos.

MAGR: Manifold-Aligned Graph Regularization for Continual Action Quality Assessment (2024)
Presentation / Conference Contribution
Zhou, K., Wang, L., Zhang, X., Shum, H. P. H., Li, F. W. B., Li, J., & Liang, X. (2024, September). MAGR: Manifold-Aligned Graph Regularization for Continual Action Quality Assessment. Presented at ECCV 2024: The 18th European Conference on Computer Vision, Milan, Italy

Action Quality Assessment (AQA) evaluates diverse skills but models struggle with non-stationary data. We propose Continual AQA (CAQA) to refine models using sparse new data. Feature replay preserves memory without storing raw inputs. However, the mi... Read More about MAGR: Manifold-Aligned Graph Regularization for Continual Action Quality Assessment.

SEM-Net: Efficient Pixel Modelling for Image Inpainting with Spatially Enhanced SSM (2024)
Presentation / Conference Contribution
Chen, S., Zhang, H., Atapour-Abarghouei, A., & Shum, H. P. H. (2025, February). SEM-Net: Efficient Pixel Modelling for Image Inpainting with Spatially Enhanced SSM. Presented at 2025 IEEE/CVF Winter Conference on Applications of Computer Vision (WACV), Tucson, Arizona

Unraveling the brain dynamics of Depersonalization-Derealization Disorder: a dynamic functional network connectivity analysis (2024)
Journal Article
Zheng, S., Zhang, F. X., Shum, H. P. H., Zhang, H., Song, N., Song, M., & Jia, H. (2024). Unraveling the brain dynamics of Depersonalization-Derealization Disorder: a dynamic functional network connectivity analysis. BMC Psychiatry, 24, Article 685. https://doi.org/10.1186/s12888-024-06096-1

Background: Depersonalization-Derealization Disorder (DPD), a prevalent psychiatric disorder, fundamentally disrupts self-consciousness and could significantly impact the quality of life of those affected. While existing research has provided foundat... Read More about Unraveling the brain dynamics of Depersonalization-Derealization Disorder: a dynamic functional network connectivity analysis.

Chatbots and Art Critique: A Comparative Study of Chatbot and Human Experts in Traditional Chinese Painting Education (2024)
Presentation / Conference Contribution
Liu, J., Law, L.-C., & Shum, H. P. H. (2024, October). Chatbots and Art Critique: A Comparative Study of Chatbot and Human Experts in Traditional Chinese Painting Education. Presented at NordiCHI 2024, Uppsala

Driven by the recent incorporation of chatbots into art education, art critique as a key factor in this realm poses distinct challenges and opportunities for this technology intervention. This study investigates the efficacy of chatbot-generated crit... Read More about Chatbots and Art Critique: A Comparative Study of Chatbot and Human Experts in Traditional Chinese Painting Education.

Chatbots and Art Critique: A Comparative Study of Chatbot and Human Experts in Traditional Chinese Painting Education (2024)
Presentation / Conference Contribution
Liu, J., Law, E. L.-C., & Shum, H. P. H. (2024, October). Chatbots and Art Critique: A Comparative Study of Chatbot and Human Experts in Traditional Chinese Painting Education. Presented at NordiCHI 2024: Nordic Conference on Human-Computer Interaction, Uppsala Sweden

Driven by the recent incorporation of chatbots into art education, art critique as a key factor in this realm poses distinct challenges and opportunities for this technology intervention. This study investigates the efficacy of chatbot-generated crit... Read More about Chatbots and Art Critique: A Comparative Study of Chatbot and Human Experts in Traditional Chinese Painting Education.

RAPiD-Seg: Range-Aware Pointwise Distance Distribution Networks for 3D LiDAR Segmentation (2024)
Presentation / Conference Contribution
Li, L., Shum, H. P. H., & Breckon, T. P. (2024, September). RAPiD-Seg: Range-Aware Pointwise Distance Distribution Networks for 3D LiDAR Segmentation. Presented at ECCV 2024: European Conference on Computer Vision, Milan, Italy

3D point clouds play a pivotal role in outdoor scene perception, especially in the context of autonomous driving. Recent advancements in 3D LiDAR segmentation often focus intensely on the spatial positioning and distribution of points for accurate se... Read More about RAPiD-Seg: Range-Aware Pointwise Distance Distribution Networks for 3D LiDAR Segmentation.

Two-Person Interaction Augmentation with Skeleton Priors (2024)
Presentation / Conference Contribution
Li, B., Ho, E. S. L., Shum, H. P. H., & Wang, H. (2024, June). Two-Person Interaction Augmentation with Skeleton Priors. Presented at 2024 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW), Seattle, Washington

Close and continuous interaction with rich contacts is a crucial aspect of human activities (e.g. hugging, dancing) and of interest in many domains like activity recognition, motion prediction, character animation, etc. However, acquiring such skelet... Read More about Two-Person Interaction Augmentation with Skeleton Priors.

Repeat and Concatenate: 2D to 3D Image Translation with 3D to 3D Generative Modeling (2024)
Presentation / Conference Contribution
Corona-Figueroa, A., Shum, H. P. H., & Willcocks, C. G. (2024, June). Repeat and Concatenate: 2D to 3D Image Translation with 3D to 3D Generative Modeling. Presented at 2024 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Seattle, Washington

Advancing healthcare practice and education via data sharing: demonstrating the utility of open data by training an artificial intelligence model to assess cardiopulmonary resuscitation skills. (2024)
Journal Article
Constable, M. D., Zhang, F. X., Conner, T., Monk, D., Rajsic, J., Ford, C., Park, L. J., Platt, A., Porteous, D., Grierson, L., & Shum, H. P. H. (online). Advancing healthcare practice and education via data sharing: demonstrating the utility of open data by training an artificial intelligence model to assess cardiopulmonary resuscitation skills. Advances in Health Sciences Education, https://doi.org/10.1007/s10459-024-10369-5

Health professional education stands to gain substantially from collective efforts toward building video databases of skill performances in both real and simulated settings. An accessible resource of videos that demonstrate an array of performances –... Read More about Advancing healthcare practice and education via data sharing: demonstrating the utility of open data by training an artificial intelligence model to assess cardiopulmonary resuscitation skills..

One-Index Vector Quantization Based Adversarial Attack on Image Classification (2024)
Journal Article
Fan, H., Qin, X., Chen, S., Shum, H. P. H., & Li, M. (2024). One-Index Vector Quantization Based Adversarial Attack on Image Classification. Pattern Recognition Letters, 186, 47-56. https://doi.org/10.1016/j.patrec.2024.09.001

To improve storage and transmission, images are generally compressed. Vector quantization (VQ) is a popular compression method as it has a high compression ratio that suppresses other compression techniques. Despite this, existing adversarial attack... Read More about One-Index Vector Quantization Based Adversarial Attack on Image Classification.

ST-SACLF: Style Transfer Informed Self-Attention Classifier for Bias-Aware Painting Classification (2024)
Book Chapter
Vijendran, M., Li, F. W. B., Deng, J., & Shum, H. P. H. (in press). ST-SACLF: Style Transfer Informed Self-Attention Classifier for Bias-Aware Painting Classification. In CCIS '24: Communications in Computer and Information Science. Springer

Geometric Features Enhanced Human-Object Interaction Detection (2024)
Journal Article
Zhu, M., Ho, E. S. L., Chen, S., Yang, L., & Shum, H. P. H. (2024). Geometric Features Enhanced Human-Object Interaction Detection. IEEE Transactions on Instrumentation and Measurement, 73, Article 5026014. https://doi.org/10.1109/TIM.2024.3427800

Cameras are essential vision instruments to capture images for pattern detection and measurement. Human–object interaction (HOI) detection is one of the most popular pattern detection approaches for captured human-centric visual scenes. Recently, Tra... Read More about Geometric Features Enhanced Human-Object Interaction Detection.

Depth-Aware Endoscopic Video Inpainting (2024)
Presentation / Conference Contribution
Xiatian Zhang, F., Chen, S., Xie, X., & Shum, H. P. (2024, October). Depth-Aware Endoscopic Video Inpainting. Presented at 27th International Conference on Medical Image Computing and Computer Assisted Intervention, Marrakesh, Morocco

Self-Regulated Sample Diversity in Large Language Models (2024)
Presentation / Conference Contribution
Liu, M., Frawley, J., Wyer, S., Shum, H. P. H., Uckelman, S. L., Black, S., & Willcocks, C. G. (2024, June). Self-Regulated Sample Diversity in Large Language Models. Presented at NAACL 2024: 2024 Annual Conference of the North American Chapter of the Association for Computational Linguistics, Mexico City

Professor Hubert Shum's Outputs (25)