HOMEto discover JOINING USto achieve PUBLICATIONSto innovate GRANTSto establish ACTIVITIESto engage PEOPLEto collaborate TEACHINGto inspire CONTACTSto explore

Geometric Features Informed Multi-Person Human-Object Interaction Recognition in Videos

Tanqiu Qiao, Qianhui Men, Frederick W. B. Li, Yoshiki Kubotani, Shigeo Morishima and Hubert P. H. Shum
Proceedings of the 2022 European Conference on Computer Vision (ECCV), 2022

H5-Index: 262^# Core A* Conference^‡ Citation: 35^#

Geometric Features Informed Multi-Person Human-Object Interaction Recognition in Videos

‡ According to ICORE Ranking 2026

# According to Google Scholar 2026

Abstract

Human-Object Interaction (HOI) recognition in videos is important for analysing human activity. Most existing work focusing on visual features usually suffer from occlusion in the real-world scenarios. Such a problem will be further complicated when multiple people and objects are involved in HOIs. Consider that geometric features such as human pose and object position provide meaningful information to understand HOIs, we argue to combine the benefits of both visual and geometric features in HOI recognition, and propose a novel Two-level Geometric feature-informed Graph Convolutional Network (2G-GCN). The geometric-level graph models the interdependency between geometric features of humans and objects, while the fusion-level graph further fuses them with visual features of humans and objects. To demonstrate the novelty and effectiveness of our method in challenging scenarios, we propose a new multi-person HOI dataset (MPHOI-72). Extensive experiments on MPHOI-72 (multi-person HOI), CAD-120 (single-human HOI) and Bimanual Actions (two-hand HOI) datasets demonstrate our superior performance compared to state-of-the-arts.

Downloads

Paper (1.6MB)

Supplementary Material (0.2MB)

GitHub

Dataset

arXiv

DOI - Publisher's Page

Cite This Research

Plain Text

Tanqiu Qiao, Qianhui Men, Frederick W. B. Li, Yoshiki Kubotani, Shigeo Morishima and Hubert P. H. Shum, "Geometric Features Informed Multi-Person Human-Object Interaction Recognition in Videos," in ECCV '22: Proceedings of the 2022 European Conference on Computer Vision, pp. 474-491, Tel Aviv, Israel, Springer, Oct 2022.

BibTeX

@inproceedings{qiao22geometric,
author={Qiao, Tanqiu and Men, Qianhui and Li, Frederick W. B. and Kubotani, Yoshiki and Morishima, Shigeo and Shum, Hubert P. H.},
booktitle={Proceedings of the 2022 European Conference on Computer Vision},
series={ECCV '22},
title={Geometric Features Informed Multi-Person Human-Object Interaction Recognition in Videos},
year={2022},
month={10},
pages={474--491},
numpages={18},
doi={10.1007/978-3-031-19772-7_28},
isbn={978-3-031-19772-7},
publisher={Springer},
location={Tel Aviv, Israel},
}

RIS

TY  - CONF
AU  - Qiao, Tanqiu
AU  - Men, Qianhui
AU  - Li, Frederick W. B.
AU  - Kubotani, Yoshiki
AU  - Morishima, Shigeo
AU  - Shum, Hubert P. H.
T2  - Proceedings of the 2022 European Conference on Computer Vision
TI  - Geometric Features Informed Multi-Person Human-Object Interaction Recognition in Videos
PY  - 2022
Y1  - 10 2022
SP  - 474
EP  - 491
DO  - 10.1007/978-3-031-19772-7_28
SN  - 978-3-031-19772-7
PB  - Springer
ER  -

Similar Research

Tanqiu Qiao, Ruochen Li, Frederick W. B. Li, Yoshiki Kubotani, Shigeo Morishima and Hubert P. H. Shum, "Geometric Visual Fusion Graph Neural Networks for Multi-Person Human-Object Interaction Recognition in Videos", Expert Systems with Applications (ESWA), 2025

Tanqiu Qiao, Ruochen Li, Frederick W. B. Li and Hubert P. H. Shum, "From Category to Scenery: An End-to-End Framework for Multi-Person Human-Object Interaction Recognition in Videos", Proceedings of the 2024 International Conference on Pattern Recognition (ICPR), 2024

Manli Zhu, Edmond S. L. Ho and Hubert P. H. Shum, "A Skeleton-Aware Graph Convolutional Network for Human-Object Interaction Detection", Proceedings of the 2022 IEEE International Conference on Systems, Man, and Cybernetics (SMC), 2022

Manli Zhu, Edmond S. L. Ho, Shuang Chen, Longzhi Yang and Hubert P. H. Shum, "Geometric Features Enhanced Human-Object Interaction Detection", IEEE Transactions on Instrumentation and Measurement (TIM), 2024

Qianhui Men, Howard Leung, Edmond S. L. Ho and Hubert P. H. Shum, "A Two-Stream Recurrent Network for Skeleton-Based Human Interaction Recognition", Proceedings of the 2020 International Conference on Pattern Recognition (ICPR), 2020

HomeGoogle ScholarLinkedInYouTubeGitHubORCIDResearchGateEmail

Last updated on 26 June 2026
RSS Feed