G.T. Flitton
Object Classification in 3D Baggage Security Computed Tomography Imagery using Visual Codebooks
Flitton, G.T.; Mouton, A.; Breckon, T.P.
Abstract
We investigate the performance of a Bag of (Visual) Words (BoW) object classification model as an approach for automated threat object detection within 3D Computed Tomography (CT) imagery from a baggage security context. This poses a novel and unique challenge for rigid object classification within complex and cluttered volumetric imagery. Within this context it extends the BoW model to 3D transmission imagery (X-ray CT) from its conventional application in 2D reflectance (photographic) imagery. We explore combinations of four 3D feature descriptors (Density Histogram (DH), Density Gradient Histogram (DGH), Scale Invariant Feature Transform (SIFT) and Rotation Invariant Feature Transform (RIFT)), three codebook assignment methodologies (hard, kernel and uncertainty) and seven codebook sizes. Optimal performance is achieved using the DH and DGH descriptors in conjunction with an uncertainty assignment methodology. Successful detection rates in excess of 97% for handguns and 89% for bottles and false-positive rates of approximately 2–3% are achieved. We demonstrate that the underlying imaging modality and the irrelevance of illumination and scale invariance within the transmission imagery context considered here result in the favourable performance of simpler density histogram descriptors (DH, DGH) over 3D extensions of the well-established SIFT and RIFT feature descriptor approaches.
Citation
Flitton, G., Mouton, A., & Breckon, T. (2015). Object Classification in 3D Baggage Security Computed Tomography Imagery using Visual Codebooks. Pattern Recognition, 48(8), 2489-2499. https://doi.org/10.1016/j.patcog.2015.02.006
Journal Article Type | Article |
---|---|
Acceptance Date | Feb 7, 2015 |
Online Publication Date | Feb 14, 2015 |
Publication Date | Aug 1, 2015 |
Deposit Date | Oct 4, 2015 |
Publicly Available Date | Oct 5, 2015 |
Journal | Pattern Recognition |
Print ISSN | 0031-3203 |
Publisher | Elsevier |
Peer Reviewed | Peer Reviewed |
Volume | 48 |
Issue | 8 |
Pages | 2489-2499 |
DOI | https://doi.org/10.1016/j.patcog.2015.02.006 |
Keywords | 3D Object classification, Bag of (Visual) words, 3D descriptors, SIFT, RIFT, Baggage-CT. |
Public URL | https://durham-repository.worktribe.com/output/1421582 |
Related Public URLs | http://community.dur.ac.uk/toby.breckon/publications/papers/flitton15codebooks.pdf |
Files
Accepted Journal Article
(1.3 Mb)
PDF
Publisher Licence URL
http://creativecommons.org/licenses/by-nc-nd/4.0/
Copyright Statement
© 2015 This manuscript version is made available under the CC-BY-NC-ND 4.0 license http://creativecommons.org/licenses/by-nc-nd/4.0/
You might also like
Racial Bias within Face Recognition: A Survey
(2024)
Journal Article
Disentangling Racial Phenotypes: Fine-Grained Control of Race-related Facial Phenotype Characteristics
(2024)
Preprint / Working Paper
Progressively Select and Reject Pseudo-labelled Samples for Open-Set Domain Adaptation
(2024)
Journal Article
Generalized Zero-Shot Domain Adaptation via Coupled Conditional Variational Autoencoders
(2023)
Journal Article
Downloadable Citations
About Durham Research Online (DRO)
Administrator e-mail: dro.admin@durham.ac.uk
This application uses the following open-source libraries:
SheetJS Community Edition
Apache License Version 2.0 (http://www.apache.org/licenses/)
PDF.js
Apache License Version 2.0 (http://www.apache.org/licenses/)
Font Awesome
SIL OFL 1.1 (http://scripts.sil.org/OFL)
MIT License (http://opensource.org/licenses/mit-license.html)
CC BY 3.0 ( http://creativecommons.org/licenses/by/3.0/)
Powered by Worktribe © 2024
Advanced Search