Resolving Hand-Object Occlusion for Mixed Reality with Joint Deep Learning and Model Optimization

Qi Feng, Hubert P. H. Shum and Shigeo Morishima
Computer Animation and Virtual Worlds (CAVW) - Proceedings of the 2020 International Conference on Computer Animation and Social Agents (CASA), 2020

Resolving Hand-Object Occlusion for Mixed Reality with Joint Deep Learning and Model Optimization

Abstract

By overlaying virtual imagery onto the real world, mixed reality facilitates diverse applications and has drawn increasing attention. Enhancing physical in-hand objects with a virtual appearance is a key component for many applications that require users to interact with tools such as surgery simulations. However, due to complex hand articulations and severe hand-object occlusions, resolving occlusions in hand-object interactions is a challenging topic. Traditional tracking-based approaches are limited by strong ambiguities from occlusions and changing shapes, while reconstruction-based methods show poor capability of handling dynamic scenes. In this paper, we propose a novel real-time optimization system to resolve hand-object occlusions by spatially reconstructing the scene with estimated hand joints and masks. To acquire accurate results, we propose a joint learning process that shares information between two models and jointly estimates hand poses and semantic segmentation. To facilitate the joint learning system and improve its accuracy under occlusions, we propose an occlusion-aware RGB-D hand dataset that mitigates the ambiguity through precise annotations and photorealistic appearance. Evaluations show more consistent overlays compared to literature, and a user study verifies a more realistic experience.


Downloads


YouTube




Cite This Research

Plain Text

Qi Feng, Hubert P. H. Shum and Shigeo Morishima, "Resolving Hand-Object Occlusion for Mixed Reality with Joint Deep Learning and Model Optimization," Computer Animation and Virtual Worlds, vol. 31, no. 4--5, pp. e1956, John Wiley and Sons Ltd., 2020.

BibTeX

@article{feng20resolving,
 author={Feng, Qi and Shum, Hubert P. H. and Morishima, Shigeo},
 journal={Computer Animation and Virtual Worlds},
 series={CASA '24},
 title={Resolving Hand-Object Occlusion for Mixed Reality with Joint Deep Learning and Model Optimization},
 year={2020},
 volume={31},
 number={4--5},
 pages={e1956},
 numpages={12},
 doi={10.1002/cav.1956},
 publisher={John Wiley and Sons Ltd.},
 Address={Chichester, UK},
}

RIS

TY  - JOUR
AU  - Feng, Qi
AU  - Shum, Hubert P. H.
AU  - Morishima, Shigeo
T2  - Computer Animation and Virtual Worlds
TI  - Resolving Hand-Object Occlusion for Mixed Reality with Joint Deep Learning and Model Optimization
PY  - 2020
VL  - 31
IS  - 4--5
SP  - e1956
EP  - e1956
DO  - 10.1002/cav.1956
PB  - John Wiley and Sons Ltd.
ER  - 


Supporting Grants


Similar Research

Qi Feng, Hubert P. H. Shum and Shigeo Morishima, "Resolving Occlusion for 3D Object Manipulation with Hands in Mixed Reality", Proceedings of the 2018 ACM Symposium on Virtual Reality Software and Technology (VRST) Posters, 2018
Qi Feng, Hubert P. H. Shum and Shigeo Morishima, "Occlusion for 3D Object Manipulation with Hands in Augmented Reality", Proceedings of the 2018 Meeting on Image Recognition and Understanding (MIRU), 2018
Kanglei Zhou, Chen Chen, Yue Ma, Zhiying Leng, Hubert P. H. Shum, Frederick W. B. Li and Xiaohui Liang, "A Mixed Reality Training System for Hand-Object Interaction in Simulated Microgravity Environments", Proceedings of the 2023 IEEE International Symposium on Mixed and Augmented Reality (ISMAR), 2023
Kanglei Zhou, Jiaying Chen, Hubert P. H. Shum, Frederick W. B. Li and Xiaohui Liang, "STGAE: Spatial Temporal Graph Auto-Encoder for Hand Motion Denoising", Proceedings of the 2021 IEEE International Symposium on Mixed and Augmented Reality (ISMAR), 2021
Zhiying Leng, Jiaying Chen, Hubert P. H. Shum, Frederick W. B. Li and Xiaohui Liang, "Stable Hand Pose Estimation under Tremor via Graph Neural Network", Proceedings of the 2021 IEEE Conference on Virtual Reality and 3D User Interfaces (VR), 2021

 

Last updated on 7 September 2024
RSS Feed