Skip to main content

Research Repository

Advanced Search

Outputs (16)

Bi-projection-based Foreground-aware Omnidirectional Depth Prediction (2021)
Conference Proceeding
Feng, Q., Shum, H. P., & Morishima, S. (2021). Bi-projection-based Foreground-aware Omnidirectional Depth Prediction.

Due to the increasing availability of commercial 360- degree cameras, accurate depth prediction for omnidirectional images can be beneficial to a wide range of applications including video editing and augmented reality. Regarding existing methods, so... Read More about Bi-projection-based Foreground-aware Omnidirectional Depth Prediction.

DurLAR: A High-Fidelity 128-Channel LiDAR Dataset with Panoramic Ambient and Reflectivity Imagery for Multi-Modal Autonomous Driving Applications (2021)
Conference Proceeding
Li, L., Ismail, K. N., Shum, H. P., & Breckon, T. P. (2021). DurLAR: A High-Fidelity 128-Channel LiDAR Dataset with Panoramic Ambient and Reflectivity Imagery for Multi-Modal Autonomous Driving Applications. . https://doi.org/10.1109/3dv53792.2021.00130

We present DurLAR, a high-fidelity 128-channel 3D LiDAR dataset with panoramic ambient (near infrared) and reflectivity imagery, as well as a sample benchmark task using depth estimation for autonomous driving applications. Our driving platform is eq... Read More about DurLAR: A High-Fidelity 128-Channel LiDAR Dataset with Panoramic Ambient and Reflectivity Imagery for Multi-Modal Autonomous Driving Applications.

Semantics-STGCNN: A Semantics-guided Spatial-Temporal Graph Convolutional Network for Multi-class Trajectory Prediction (2021)
Conference Proceeding
Rainbow, B. A., Men, Q., & Shum, H. P. (2021). Semantics-STGCNN: A Semantics-guided Spatial-Temporal Graph Convolutional Network for Multi-class Trajectory Prediction. . https://doi.org/10.1109/smc52423.2021.9658781

Predicting the movement trajectories of multiple classes of road users in real-world scenarios is a challenging task due to the diverse trajectory patterns. While recent works of pedestrian trajectory prediction successfully modelled the influence of... Read More about Semantics-STGCNN: A Semantics-guided Spatial-Temporal Graph Convolutional Network for Multi-class Trajectory Prediction.

A Pose-based Feature Fusion and Classification Framework for the Early Prediction of Cerebral Palsy in Infants (2021)
Journal Article
McCay, K. D., Hu, P., Shum, H. P., Woo, W. L., Marcroft, C., Embleton, N. D., …Ho, E. S. (2022). A Pose-based Feature Fusion and Classification Framework for the Early Prediction of Cerebral Palsy in Infants. IEEE Transactions on Neural Systems and Rehabilitation Engineering, 30, 8-19. https://doi.org/10.1109/tnsre.2021.3138185

The early diagnosis of cerebral palsy is an area which has recently seen significant multi-disciplinary research. Diagnostic tools such as the General Movements Assessment (GMA), have produced some very promising results. However, the prospect of aut... Read More about A Pose-based Feature Fusion and Classification Framework for the Early Prediction of Cerebral Palsy in Infants.

PyTorch-based Implementation of Label-aware Graph Representation for Multi-class Trajectory Prediction (2021)
Journal Article
Men, Q., & Shum, H. P. (2022). PyTorch-based Implementation of Label-aware Graph Representation for Multi-class Trajectory Prediction. Software impacts, 11, Article 100201. https://doi.org/10.1016/j.simpa.2021.100201

Trajectory Prediction under diverse patterns has attracted increasing attention in multiple real-world applications ranging from urban traffic analysis to human motion understanding, among which graph convolution network (GCN) is frequently adopted w... Read More about PyTorch-based Implementation of Label-aware Graph Representation for Multi-class Trajectory Prediction.

Unmanned Aerial Vehicle Visual Detection and Tracking using Deep Neural Networks: A Performance Benchmark (2021)
Conference Proceeding
Isaac-Medina, B. K., Poyser, M., Organisciak, D., Willcocks, C. G., Breckon, T. P., & Shum, H. P. (2021). Unmanned Aerial Vehicle Visual Detection and Tracking using Deep Neural Networks: A Performance Benchmark. . https://doi.org/10.1109/iccvw54120.2021.00142

Unmanned Aerial Vehicles (UAV) can pose a major risk for aviation safety, due to both negligent and malicious use. For this reason, the automated detection and tracking of UAV is a fundamental task in aerial security systems. Common technologies for... Read More about Unmanned Aerial Vehicle Visual Detection and Tracking using Deep Neural Networks: A Performance Benchmark.

STGAE: Spatial-Temporal Graph Auto-Encoder for Hand Motion Denoising (2021)
Conference Proceeding
Zhou, K., Cheng, Z., Shum, H. P., Li, F. W., & Liang, X. (2021). STGAE: Spatial-Temporal Graph Auto-Encoder for Hand Motion Denoising. . https://doi.org/10.1109/ismar52148.2021.00018

Hand object interaction in mixed reality (MR) relies on the accurate tracking and estimation of human hands, which provide users with a sense of immersion. However, raw captured hand motion data always contains errors such as joints occlusion, disloc... Read More about STGAE: Spatial-Temporal Graph Auto-Encoder for Hand Motion Denoising.

Human-centric Autonomous Driving in an AV-Pedestrian Interactive Environment Using SVO (2021)
Conference Proceeding
Crosato, L., Wei, C., Ho, E. S., & Shum, H. P. (2021). Human-centric Autonomous Driving in an AV-Pedestrian Interactive Environment Using SVO. . https://doi.org/10.1109/ichms53169.2021.9582640

As Autonomous Vehicles (AV) are becoming a reality, the design of efficient motion control algorithms will have to deal with the unpredictable and interactive nature of other road users. Current AV motion planning algorithms suffer from the freezing... Read More about Human-centric Autonomous Driving in an AV-Pedestrian Interactive Environment Using SVO.

GAN-based Reactive Motion Synthesis with Class-aware Discriminators for Human-human Interaction (2021)
Journal Article
Men, Q., Shum, H. P., Ho, E. S., & Leung, H. (2022). GAN-based Reactive Motion Synthesis with Class-aware Discriminators for Human-human Interaction. Computers and Graphics, 102, 634-645. https://doi.org/10.1016/j.cag.2021.09.014

Creating realistic characters that can react to the users’ or another character’s movement can benefit computer graphics, games and virtual reality hugely. However, synthesizing such reactive motions in human-human interactions is a challenging task... Read More about GAN-based Reactive Motion Synthesis with Class-aware Discriminators for Human-human Interaction.

Interpreting Deep Learning based Cerebral Palsy Prediction with Channel Attention (2021)
Conference Proceeding
Zhu, M., Men, Q., Ho, E. S., Leung, H., & Shum, H. P. (2021). Interpreting Deep Learning based Cerebral Palsy Prediction with Channel Attention. . https://doi.org/10.1109/bhi50953.2021.9508619

Early prediction of cerebral palsy is essential as it leads to early treatment and monitoring. Deep learning has shown promising results in biomedical engineering thanks to its capacity of modelling complicated data with its non-linear architecture.... Read More about Interpreting Deep Learning based Cerebral Palsy Prediction with Channel Attention.