Tanqiu Qiao tanqiu.qiao@durham.ac.uk
PGR Student Doctor of Philosophy
From Category to Scenery: An End-to-End Framework for Multi-Person Human-Object Interaction Recognition in Videos
Qiao, Tanqiu; Li, Ruochen; Li, Frederick W B; Shum, Hubert P H
Authors
Ruochen Li ruochen.li@durham.ac.uk
PGR Student Doctor of Philosophy
Dr Frederick Li frederick.li@durham.ac.uk
Associate Professor
Professor Hubert Shum hubert.shum@durham.ac.uk
Professor
Abstract
Video-based Human-Object Interaction (HOI) recognition explores the intricate dynamics between humans and objects, which are essential for a comprehensive understanding of human behavior and intentions. While previous work has made significant strides, effectively integrating geometric and visual features to model dynamic relationships between humans and objects in a graph framework remains a challenge. In this work, we propose a novel end-to-end category to scenery framework, CATS, starting by generating geometric features for various categories through graphs respectively, then fusing them with corresponding visual features. Subsequently, we construct a scenery interactive graph with these enhanced geometric-visual features as nodes to learn the relationships among human and object categories. This methodological advance facilitates a deeper, more structured comprehension of interactions, bridging category-specific insights with broad scenery dynamics. Our method demonstrates state-of-the-art performance on two pivotal HOI benchmarks , including the MPHOI-72 dataset for multi-person HOIs and the single-person HOI CAD-120 dataset.
Citation
Qiao, T., Li, R., Li, F. W. B., & Shum, H. P. H. (2024, December). From Category to Scenery: An End-to-End Framework for Multi-Person Human-Object Interaction Recognition in Videos. Presented at ICPR 2024: International Conference on Pattern Recognition, Kolkata, India
Presentation Conference Type | Conference Paper (published) |
---|---|
Conference Name | ICPR 2024: International Conference on Pattern Recognition |
Start Date | Dec 1, 2024 |
End Date | Dec 5, 2024 |
Acceptance Date | Jul 1, 2024 |
Online Publication Date | Dec 4, 2024 |
Publication Date | Dec 4, 2024 |
Deposit Date | Jul 2, 2024 |
Publicly Available Date | Dec 11, 2024 |
Print ISSN | 0302-9743 |
Publisher | Springer |
Peer Reviewed | Peer Reviewed |
Pages | 262-277 |
Series Title | Lecture Notes in Computer Science |
Series Number | 15315 |
Series ISSN | 0302-9743 |
Book Title | Pattern Recognition 27th International Conference, ICPR 2024, Kolkata, India, December 1–5, 2024, Proceedings, Part XV |
ISBN | 9783031783531 |
DOI | https://doi.org/10.1007/978-3-031-78354-8_17 |
Public URL | https://durham-repository.worktribe.com/output/2514413 |
Files
Accepted Journal Article
(1.1 Mb)
PDF
Licence
http://creativecommons.org/licenses/by/4.0/
Copyright Statement
This accepted manuscript is licensed under the Creative Commons Attribution 4.0 licence. https://creativecommons.org/licenses/by/4.0/
You might also like
Geometric Features Informed Multi-person Human-object Interaction Recognition in Videos
(2022)
Presentation / Conference Contribution
TraIL-Det: Transformation-Invariant Local Feature Networks for 3D LiDAR Object Detection with Unsupervised Pre-Training
(2024)
Presentation / Conference Contribution
Multiclass-SGCN: Sparse Graph-based Trajectory Prediction with Agent Class Embedding
(2022)
Presentation / Conference Contribution
Multi-Feature Fusion Enhanced Monocular Depth Estimation With Boundary Awareness
(2024)
Journal Article
Downloadable Citations
About Durham Research Online (DRO)
Administrator e-mail: dro.admin@durham.ac.uk
This application uses the following open-source libraries:
SheetJS Community Edition
Apache License Version 2.0 (http://www.apache.org/licenses/)
PDF.js
Apache License Version 2.0 (http://www.apache.org/licenses/)
Font Awesome
SIL OFL 1.1 (http://scripts.sil.org/OFL)
MIT License (http://opensource.org/licenses/mit-license.html)
CC BY 3.0 ( http://creativecommons.org/licenses/by/3.0/)
Powered by Worktribe © 2024
Advanced Search