Tanqiu Qiao tanqiu.qiao@durham.ac.uk
PGR Student Doctor of Philosophy
Geometric Features Informed Multi-person Human-object Interaction Recognition in Videos
Qiao, Tanqiu; Men, Qianhui; Li, Frederick W.B.; Kubotani, Yoshiki; Morishima, Shigeo; Shum, Hubert P.H.
Authors
Qianhui Men
Dr Frederick Li frederick.li@durham.ac.uk
Associate Professor
Yoshiki Kubotani
Shigeo Morishima
Professor Hubert Shum hubert.shum@durham.ac.uk
Professor
Abstract
Human-Object Interaction (HOI) recognition in videos is important for analysing human activity. Most existing work focusing on visual features usually suffer from occlusion in the real-world scenarios. Such a problem will be further complicated when multiple people and objects are involved in HOIs. Consider that geometric features such as human pose and object position provide meaningful information to understand HOIs, we argue to combine the benefits of both visual and geometric features in HOI recognition, and propose a novel Two-level Geometric feature-informed Graph Convolutional Network (2G-GCN). The geometric-level graph models the interdependency between geometric features of humans and objects, while the fusion-level graph further fuses them with visual features of humans and objects. To demonstrate the novelty and effectiveness of our method in challenging scenarios, we propose a new multi-person HOI dataset (MPHOI-72). Extensive experiments on MPHOI-72 (multi-person HOI), CAD-120 (single-human HOI) and Bimanual Actions (two-hand HOI) datasets demonstrate our superior performance compared to state-of-the-arts.
Citation
Qiao, T., Men, Q., Li, F. W., Kubotani, Y., Morishima, S., & Shum, H. P. (2022, October). Geometric Features Informed Multi-person Human-object Interaction Recognition in Videos. Presented at Computer Vision - ECCV 2022, Tel Aviv, Israel
Presentation Conference Type | Conference Paper (published) |
---|---|
Conference Name | Computer Vision - ECCV 2022 |
Start Date | Oct 23, 2022 |
End Date | Oct 27, 2022 |
Acceptance Date | Jul 8, 2022 |
Online Publication Date | Oct 28, 2022 |
Publication Date | 2022 |
Deposit Date | Jul 19, 2022 |
Publicly Available Date | Oct 29, 2023 |
Print ISSN | 0302-9743 |
Publisher | Springer Verlag |
Pages | 474-491 |
Series Title | Lecture Notes in Computer Science |
Series ISSN | 0302-9743 |
DOI | https://doi.org/10.1007/978-3-031-19772-7_28 |
Public URL | https://durham-repository.worktribe.com/output/1136540 |
Files
Accepted Conference Proceeding
(1.7 Mb)
PDF
Copyright Statement
The final authenticated version is available online at https://doi.org/https://doi.org/10.1007/978-3-031-19772-7_28
You might also like
From Category to Scenery: An End-to-End Framework for Multi-Person Human-Object Interaction Recognition in Videos
(2024)
Presentation / Conference Contribution
TraIL-Det: Transformation-Invariant Local Feature Networks for 3D LiDAR Object Detection with Unsupervised Pre-Training
(2024)
Presentation / Conference Contribution
Unified Spatial-Temporal Edge-Enhanced Graph Networks for Pedestrian Trajectory Prediction
(2025)
Journal Article
A Differential Diffusion Theory for Participating Media
(2023)
Journal Article
Downloadable Citations
About Durham Research Online (DRO)
Administrator e-mail: dro.admin@durham.ac.uk
This application uses the following open-source libraries:
SheetJS Community Edition
Apache License Version 2.0 (http://www.apache.org/licenses/)
PDF.js
Apache License Version 2.0 (http://www.apache.org/licenses/)
Font Awesome
SIL OFL 1.1 (http://scripts.sil.org/OFL)
MIT License (http://opensource.org/licenses/mit-license.html)
CC BY 3.0 ( http://creativecommons.org/licenses/by/3.0/)
Powered by Worktribe © 2025
Advanced Search