From Category to Scenery: An End-to-End Framework for Multi-Person Human-Object Interaction Recognition in Videos

Tanqiu Qiao, Ruochen Li, Frederick W. B. Li and Hubert P. H. Shum
Proceedings of the 2024 International Conference on Pattern Recognition (ICPR), 2024

 H5-Index: 56#

From Category to Scenery: An End-to-End Framework for Multi-Person Human-Object Interaction Recognition in Videos
# According to Google Scholar 2024

Abstract

Video-based Human-Object Interaction (HOI) recognition explores the intricate dynamics between humans and objects, which are essential for a comprehensive understanding of human behavior and intentions. While previous work has made significant strides, effectively integrating geometric and visual features to model dynamic relationships between humans and objects in a graph framework remains a challenge. In this work, we propose a novel end-to-end category to scenery framework, CATS, starting by generating geometric features for various categories through graphs respectively, then fusing them with corresponding visual features. Subsequently, we construct a scenery interactive graph with these enhanced geometric-visual features as nodes to learn the relationships among human and object categories. This methodological advance facilitates a deeper, more structured comprehension of interactions, bridging category-specific insights with broad scenery dynamics. Our method demonstrates state-of-the-art performance on two pivotal HOI benchmarks, including the MPHOI-72 dataset for multi-person HOIs and the single-person HOI CAD-120 dataset.


Downloads


YouTube


Cite This Research

Plain Text

Tanqiu Qiao, Ruochen Li, Frederick W. B. Li and Hubert P. H. Shum, "From Category to Scenery: An End-to-End Framework for Multi-Person Human-Object Interaction Recognition in Videos," in ICPR '24: Proceedings of the 2024 International Conference on Pattern Recognition, Kolkata, India, 2024.

BibTeX

@inproceedings{qiao24from,
 author={Qiao, Tanqiu and Li, Ruochen and Li, Frederick W. B. and Shum, Hubert P. H.},
 booktitle={Proceedings of the 2024 International Conference on Pattern Recognition},
 series={ICPR '24},
 title={From Category to Scenery: An End-to-End Framework for Multi-Person Human-Object Interaction Recognition in Videos},
 year={2024},
 location={Kolkata, India},
}

RIS

TY  - CONF
AU  - Qiao, Tanqiu
AU  - Li, Ruochen
AU  - Li, Frederick W. B.
AU  - Shum, Hubert P. H.
T2  - Proceedings of the 2024 International Conference on Pattern Recognition
TI  - From Category to Scenery: An End-to-End Framework for Multi-Person Human-Object Interaction Recognition in Videos
PY  - 2024
ER  - 


Supporting Grants


Similar Research

Tanqiu Qiao, Qianhui Men, Frederick W. B. Li, Yoshiki Kubotani, Shigeo Morishima and Hubert P. H. Shum, "Geometric Features Informed Multi-Person Human-Object Interaction Recognition in Videos", Proceedings of the 2022 European Conference on Computer Vision (ECCV), 2022
Manli Zhu, Edmond S. L. Ho and Hubert P. H. Shum, "A Skeleton-Aware Graph Convolutional Network for Human-Object Interaction Detection", Proceedings of the 2022 IEEE International Conference on Systems, Man, and Cybernetics (SMC), 2022
Manli Zhu, Edmond S. L. Ho, Shuang Chen, Longzhi Yang and Hubert P. H. Shum, "Geometric Features Enhanced Human-Object Interaction Detection", IEEE Transactions on Instrumentation and Measurement (TIM), 2024
Qianhui Men, Howard Leung, Edmond S. L. Ho and Hubert P. H. Shum, "A Two-Stream Recurrent Network for Skeleton-Based Human Interaction Recognition", Proceedings of the 2020 International Conference on Pattern Recognition (ICPR), 2020

 

Last updated on 7 September 2024
RSS Feed