Aishah Alsehaim aishah.a.alsehaim@durham.ac.uk
PGR Student Doctor of Philosophy
VID-Trans-ReID: Enhanced Video Transformers for Person Re-identification
Alsehaim, A.; Breckon, T.P.
Authors
Professor Toby Breckon toby.breckon@durham.ac.uk
Professor
Abstract
Video-based person Re-identification (Re-ID) has received increasing attention recently due to its important role within surveillance video analysis. Video-based Re- ID expands upon earlier image-based methods by extracting person features temporally across multiple video image frames. The key challenge within person Re-ID is extracting a robust feature representation that is invariant to the challenges of pose and illumination variation across multiple camera viewpoints. Whilst most contemporary methods use a CNN based methodology, recent advances in vision transformer (ViT) architectures boost fine-grained feature discrimination via the use of both multi-head attention without any loss of feature robustness. To specifically enable ViT architectures to effectively address the challenges of video person Re-ID, we propose two novel modules constructs, Temporal Clip Shift and Shuffled (TCSS) and Video Patch Part Feature (VPPF), that boost the robustness of the resultant Re-ID feature representation. Furthermore, we combine our proposed approach with current best practices spanning both image and video based Re-ID including camera view embedding. Our proposed approach outperforms existing state-of-the-art work on the MARS, PRID2011, and iLIDS-VID Re-ID benchmark datasets achieving 96.36%, 96.63%, 94.67% rank-1 accuracy respectively and achieving 90.25% mAP on MARS.
Citation
Alsehaim, A., & Breckon, T. (2022, November). VID-Trans-ReID: Enhanced Video Transformers for Person Re-identification. Presented at BMVC 2022: The 33rd British Machine Vision Conference, London, UK
Presentation Conference Type | Conference Paper (published) |
---|---|
Conference Name | BMVC 2022: The 33rd British Machine Vision Conference |
Start Date | Nov 21, 2022 |
End Date | Nov 24, 2022 |
Acceptance Date | Sep 30, 2022 |
Online Publication Date | Nov 21, 2022 |
Publication Date | 2022-11 |
Deposit Date | Oct 13, 2022 |
Publicly Available Date | Nov 24, 2022 |
Public URL | https://durham-repository.worktribe.com/output/1134965 |
Publisher URL | https://britishmachinevisionassociation.github.io/bmvc |
Files
Published Conference Proceeding
(2 Mb)
PDF
Copyright Statement
© 2022. The copyright of this document resides with its authors.
It may be distributed unchanged freely in print or electronic forms.
You might also like
Progressively Select and Reject Pseudo-labelled Samples for Open-Set Domain Adaptation
(2024)
Journal Article
Generalized Zero-Shot Domain Adaptation via Coupled Conditional Variational Autoencoders
(2023)
Journal Article
Cross-Domain Structure Preserving Projection for Heterogeneous Domain Adaptation
(2021)
Journal Article
Downloadable Citations
About Durham Research Online (DRO)
Administrator e-mail: dro.admin@durham.ac.uk
This application uses the following open-source libraries:
SheetJS Community Edition
Apache License Version 2.0 (http://www.apache.org/licenses/)
PDF.js
Apache License Version 2.0 (http://www.apache.org/licenses/)
Font Awesome
SIL OFL 1.1 (http://scripts.sil.org/OFL)
MIT License (http://opensource.org/licenses/mit-license.html)
CC BY 3.0 ( http://creativecommons.org/licenses/by/3.0/)
Powered by Worktribe © 2025
Advanced Search