Skip to main content

Research Repository

Advanced Search

Outputs (211)

Unleashing Transformers: Parallel Token Prediction with Discrete Absorbing Diffusion for Fast High-Resolution Image Generation from Vector-Quantized Codes (2022)
Presentation / Conference Contribution
Bond-Taylor, S., Hessey, P., Sasaki, H., Breckon, T., & Willcocks, C. (2022, October). Unleashing Transformers: Parallel Token Prediction with Discrete Absorbing Diffusion for Fast High-Resolution Image Generation from Vector-Quantized Codes. Presented at ECCV 2022: European Conference on Computer Vision, Tel Aviv, Israel

Whilst diffusion probabilistic models can generate high quality image content, key limitations remain in terms of both generating high-resolution imagery and their associated high computational requirements. Recent Vector-Quantized image models have... Read More about Unleashing Transformers: Parallel Token Prediction with Discrete Absorbing Diffusion for Fast High-Resolution Image Generation from Vector-Quantized Codes.

On Depth Error from Spherical Camera Calibration within Omnidirectional Stereo Vision (2022)
Presentation / Conference Contribution
Groom, M., & Breckon, T. (2022, August). On Depth Error from Spherical Camera Calibration within Omnidirectional Stereo Vision. Presented at 26th International Conference on Pattern Recognition, Montreal, Québec

As a depth sensing approach, whilst stereo vision provides a good compromise between accuracy and cost, a key limitation is the limited field of view of the conventional cameras that are used within most stereo configurations. By contrast, the use of... Read More about On Depth Error from Spherical Camera Calibration within Omnidirectional Stereo Vision.

Multi-view Vision Transformers for Object Detection (2022)
Presentation / Conference Contribution
Isaac-Medina, B., Willcocks, C., & Breckon, T. (2022, August). Multi-view Vision Transformers for Object Detection. Presented at International Conference on Pattern Recognition, Montreal, Canada

Evaluating Gaussian Grasp Maps for Generative Grasping Models (2022)
Presentation / Conference Contribution
Prew, W., Breckon, T., Bordewich, M., & Beierholm, U. (2022, July). Evaluating Gaussian Grasp Maps for Generative Grasping Models. Presented at Proc. Int. Joint Conf. Neural Networks, Padova, Italy

Generalising robotic grasping to previously unseen objects is a key task in general robotic manipulation. The current method for training many antipodal generative grasping models rely on a binary ground truth grasp map generated from the centre thir... Read More about Evaluating Gaussian Grasp Maps for Generative Grasping Models.

Lost in Compression: the Impact of Lossy Image Compression on Variable Size Object Detection within Infrared Imagery (2022)
Presentation / Conference Contribution
Bhowmik, N., Barker, J., Gaus, Y., & Breckon, T. (2022, June). Lost in Compression: the Impact of Lossy Image Compression on Variable Size Object Detection within Infrared Imagery. Presented at 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW), New Orleans, Louisiana

Lossy image compression strategies allow for more efficient storage and transmission of data by encoding data to a reduced form. This is essential enable training with larger datasets on less storage-equipped environments. However, such compression c... Read More about Lost in Compression: the Impact of Lossy Image Compression on Variable Size Object Detection within Infrared Imagery.

Cross-modal Image Synthesis in Dual-Energy X-Ray Security Imagery (2022)
Presentation / Conference Contribution
Isaac-Medina, B., Bhowmik, N., Willcocks, C., & Breckon, T. (2022, June). Cross-modal Image Synthesis in Dual-Energy X-Ray Security Imagery. Presented at 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW), New Orleans, Louisiana

Dual-energy X-ray scanners are used for aviation security screening given their capability to discriminate materials inside passenger baggage. To facilitate manual operator inspection, a pseudo-colouring is assigned to the effective composition of th... Read More about Cross-modal Image Synthesis in Dual-Energy X-Ray Security Imagery.

Measuring Hidden Bias within Face Recognition via Racial Phenotypes (2022)
Presentation / Conference Contribution
Yucer, S., Tekras, F., Al Moubayed, N., & Breckon, T. (2022, January). Measuring Hidden Bias within Face Recognition via Racial Phenotypes. Presented at Proc. Winter Conference on Applications of Computer Vision, Waikoloa, HI

Recent work reports disparate performance for intersectional racial groups across face recognition tasks: face verification and identification. However, the definition of those racial groups has a significant impact on the underlying findings of such... Read More about Measuring Hidden Bias within Face Recognition via Racial Phenotypes.

Crowd Counting via Segmentation Guided Attention Networks and Curriculum Loss (2022)
Journal Article
Wang, Q., & Breckon, T. (2022). Crowd Counting via Segmentation Guided Attention Networks and Curriculum Loss. IEEE Transactions on Intelligent Transportation Systems, 23(9), 15233-15243. https://doi.org/10.1109/tits.2021.3138896

Automatic crowd behaviour analysis is an important task for intelligent transportation systems to enable effective flow control and dynamic route planning for varying road participants. Crowd counting is one of the keys to automatic crowd behaviour a... Read More about Crowd Counting via Segmentation Guided Attention Networks and Curriculum Loss.

Semi-supervised Surface Anomaly Detection of Composite Wind Turbine Blades from Drone Imagery (2022)
Presentation / Conference Contribution
Barker, J., Bhowmik, N., & Breckon, T. (2022, February). Semi-supervised Surface Anomaly Detection of Composite Wind Turbine Blades from Drone Imagery. Presented at Computer Vision Theory and Applications 2022

Within commercial wind energy generation, the monitoring and predictive maintenance of wind turbine blades in-situ is a crucial task, for which remote monitoring via aerial survey from an Unmanned Aerial Vehicle (UAV) is commonplace. Turbine blades a... Read More about Semi-supervised Surface Anomaly Detection of Composite Wind Turbine Blades from Drone Imagery.

DurLAR: A High-Fidelity 128-Channel LiDAR Dataset with Panoramic Ambient and Reflectivity Imagery for Multi-Modal Autonomous Driving Applications (2021)
Presentation / Conference Contribution
Li, L., Ismail, K. N., Shum, H. P., & Breckon, T. P. (2021, December). DurLAR: A High-Fidelity 128-Channel LiDAR Dataset with Panoramic Ambient and Reflectivity Imagery for Multi-Modal Autonomous Driving Applications. Presented at International Conference on 3D Vision, Surrey / Online

We present DurLAR, a high-fidelity 128-channel 3D LiDAR dataset with panoramic ambient (near infrared) and reflectivity imagery, as well as a sample benchmark task using depth estimation for autonomous driving applications. Our driving platform is eq... Read More about DurLAR: A High-Fidelity 128-Channel LiDAR Dataset with Panoramic Ambient and Reflectivity Imagery for Multi-Modal Autonomous Driving Applications.