B.K.S. Isaac-Medina
Multi-view Object Detection Using Epipolar Constraints within Cluttered X-ray Security Imagery
Isaac-Medina, B.K.S.; Willcocks, C.G.; Breckon, T.P.
Authors
Dr Chris Willcocks christopher.g.willcocks@durham.ac.uk
Associate Professor
Professor Toby Breckon toby.breckon@durham.ac.uk
Professor
Abstract
Automatic detection for threat object items is an increasing emerging area of future application in X-ray security imagery. Although modern X-ray security scanners can provide two or more views, the integration of such object detectors across the views has not been widely explored with rigour. Therefore, we investigate the application of geometric constraints using the epipolar nature of multi-view imagery to improve object detection performance. Furthermore, we assume that images come from uncalibrated views, such that a method to estimate the fundamental matrix using ground truth bounding box centroids from multiple view object labels is proposed. In addition, detections are given a confidence probability based on its similarity with respect to the distribution of the distance to the epipolar line. This probability is used as confidence weights for merging duplicated predictions using non-maximum suppression. Using a standard object detector (YOLOv3), our technique increases the average precision of detection by 2.8% on a dataset composed of firearms, laptops, knives and cameras. These results indicate that the integration of images at different views significantly improves the detection performance of threat items of cluttered X-ray security images.
Presentation Conference Type | Conference Paper (Published) |
---|---|
Conference Name | 25th International Conference on Pattern Recognition (ICPR 2020) |
Start Date | Jan 10, 2021 |
End Date | Jan 15, 2021 |
Acceptance Date | Oct 11, 2020 |
Online Publication Date | May 5, 2021 |
Publication Date | 2021 |
Deposit Date | Oct 25, 2020 |
Publicly Available Date | Oct 27, 2020 |
Series ISSN | 1051-4651 |
DOI | https://doi.org/10.1109/icpr48806.2021.9413007 |
Public URL | https://durham-repository.worktribe.com/output/1141503 |
Files
Accepted Conference Proceeding
(4 Mb)
PDF
Copyright Statement
© 2021 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works.
You might also like
Repeat and Concatenate: 2D to 3D Image Translation with 3D to 3D Generative Modeling
(2024)
Conference Proceeding
Unaligned 2D to 3D Translation with Conditional Vector-Quantized Code Diffusion using Transformers
(2023)
Conference Proceeding
∞-Diff: Infinite Resolution Diffusion with Subsampled Mollified States
(2024)
Conference Proceeding
Self-Regulated Sample Diversity in Large Language Models
(2024)
Conference Proceeding
Downloadable Citations
About Durham Research Online (DRO)
Administrator e-mail: dro.admin@durham.ac.uk
This application uses the following open-source libraries:
SheetJS Community Edition
Apache License Version 2.0 (http://www.apache.org/licenses/)
PDF.js
Apache License Version 2.0 (http://www.apache.org/licenses/)
Font Awesome
SIL OFL 1.1 (http://scripts.sil.org/OFL)
MIT License (http://opensource.org/licenses/mit-license.html)
CC BY 3.0 ( http://creativecommons.org/licenses/by/3.0/)
Powered by Worktribe © 2024
Advanced Search