STIT: Spatio-Temporal Interaction Transformers for Human-Object Interaction Recognition in Videos
(2022)
Presentation / Conference Contribution
Almushyti, M., & Li, F. W. (2022). STIT: Spatio-Temporal Interaction Transformers for Human-Object Interaction Recognition in Videos. . https://doi.org/10.1109/icpr56361.2022.9956030
Recognizing human-object interactions is challenging due to their spatio-temporal changes. We propose the SpatioTemporal Interaction Transformer-based (STIT) network to reason such changes. Specifically, spatial transformers learn humans and objects... Read More about STIT: Spatio-Temporal Interaction Transformers for Human-Object Interaction Recognition in Videos.