Qian Wang qian.wang@durham.ac.uk
Academic Visitor
Multi-Class 3D Object Detection Within Volumetric 3D Computed Tomography Baggage Security Screening Imagery
Wang, Q.; Bhowmik, N.; Breckon, T.P.
Authors
Dr Neelanjan Bhowmik neelanjan.bhowmik@durham.ac.uk
Post Doctoral Research Associate
Professor Toby Breckon toby.breckon@durham.ac.uk
Professor
Abstract
Automatic detection of prohibited objects within passenger baggage is important for aviation security. X-ray Computed Tomography (CT) based 3D imaging is widely used in airports for aviation security screening whilst prior work on automatic prohibited item detection focus primarily on 2D X-ray imagery. Whilst some prior work has proven the possibility of extending deep convolutional neural networks (CNN) based automatic prohibited item detection from 2D X-ray imagery to volumetric 3D CT baggage security screening imagery, it focuses on the detection of one specific type of objects (e.g., either bottles or handguns). As a result, multiple models are needed if more than one type of prohibited item is required to be detected in practice. In this paper, we consider the detection of multiple object categories of interest using one unified framework. To this end, we formulate a more challenging multi-class 3D object detection problem within 3D CT imagery and propose a viable solution (3D RetinaNet) to tackle this problem. To enhance the performance of detection we investigate a variety of strategies including data augmentation and varying backbone networks. Experimentation carried out to provide both quantitative and qualitative evaluations of the proposed approach to multi-class 3D object detection within 3D CT baggage security screening imagery. Experimental results demonstrate the combination of the 3D RetinaNet and a series of favorable strategies can achieve a mean Average Precision (mAP) of 65.3% over five object classes (i.e. bottles, handguns, binoculars, glock frames, iPods). The overall performance is affected by the poor performance on glock frames and iPods due to the lack of data and their resemblance with the baggage clutter.
Citation
Wang, Q., Bhowmik, N., & Breckon, T. (2020, December). Multi-Class 3D Object Detection Within Volumetric 3D Computed Tomography Baggage Security Screening Imagery. Presented at 19th IEEE International Conference on Machine Learning and Applications (ICMLA 2020), Miami, Florida
Presentation Conference Type | Conference Paper (published) |
---|---|
Conference Name | 19th IEEE International Conference on Machine Learning and Applications (ICMLA 2020) |
Start Date | Dec 14, 2020 |
End Date | Dec 17, 2020 |
Acceptance Date | Sep 16, 2020 |
Online Publication Date | Feb 23, 2021 |
Publication Date | 2021 |
Deposit Date | Oct 26, 2020 |
Publicly Available Date | Oct 27, 2020 |
Publisher | Institute of Electrical and Electronics Engineers |
DOI | https://doi.org/10.1109/icmla51294.2020.00012 |
Public URL | https://durham-repository.worktribe.com/output/1140077 |
Files
Accepted Conference Proceeding
(1.4 Mb)
PDF
Copyright Statement
© 2021 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works.
You might also like
Generalized Zero-Shot Domain Adaptation via Coupled Conditional Variational Autoencoders
(2023)
Journal Article
Cross-Domain Structure Preserving Projection for Heterogeneous Domain Adaptation
(2021)
Journal Article
DP2-NILM: A distributed and privacy-preserving framework for non-intrusive load monitoring
(2023)
Journal Article
Downloadable Citations
About Durham Research Online (DRO)
Administrator e-mail: dro.admin@durham.ac.uk
This application uses the following open-source libraries:
SheetJS Community Edition
Apache License Version 2.0 (http://www.apache.org/licenses/)
PDF.js
Apache License Version 2.0 (http://www.apache.org/licenses/)
Font Awesome
SIL OFL 1.1 (http://scripts.sil.org/OFL)
MIT License (http://opensource.org/licenses/mit-license.html)
CC BY 3.0 ( http://creativecommons.org/licenses/by/3.0/)
Powered by Worktribe © 2024
Advanced Search