Cross-view Multimodal Vision-Based Assessment Framework for Traditional Chinese Medicine Rehabilitation Training

Francis Xiatian Zhang, Hao Yao, Shengxuan Chen, Hong Zhu, Hongxiao Jia, Sisi Zheng and Hubert P. H. Shum
IEEE Transactions on Neural Systems and Rehabilitation Engineering (TNSRE), 2026

Impact Factor: 5.2Top 25% Journal in Engineering, Biomedical

Cross-view Multimodal Vision-Based Assessment Framework for Traditional Chinese Medicine Rehabilitation Training

Abstract

Vision-based assessment can provide convenient and cost-effective evaluation in Traditional Chinese Medicine (TCM) rehabilitation training, where action quality assessment (AQA) from computer vision offers a promising solution. Existing automatic AQA frameworks for physical therapy typically rely on skeletal data captured from a single viewpoint, which is inefficient for TCM techniques such as acupuncture or Tuina that involve dense hand self-occlusion and complex hand–object interactions. To address these challenges, we propose CME-AQA, a cross-view, multimodal vision-based assessment framework that integrates visual–pose fusion to enhance understanding of environmental context and leverages both first-person and third-person videos during training to improve inference robustness. We collected two dual-view datasets, TCM-AQA61-A (Acupuncture) and TCM-AQA61-T (Tuina), each containing synchronized first- and third-person recordings of 61 subjects with expert annotations. Experimental results show that our approach achieves superior or comparable mean performance against competitive baselines, achieving over 10% relative improvement in weighted F1 over the best competing method on key rating tasks such as Needle Depth and Quick Needle Insertion, while also reducing mean absolute error in quantitative measures such as insertion time and manipulation frequency. Testing on a CPR dataset further demonstrates comparable performance on several posture-based criteria, suggesting applicability to related structured simulated clinical skill assessments where participant motion is central to evaluation. Overall, CME-AQA enhances assessment accuracy for structured TCM rehabilitation training and facilitates more convenient and effective training-oriented skill evaluation.


Downloads


YouTube


Cite This Research

Plain Text

Francis Xiatian Zhang, Hao Yao, Shengxuan Chen, Hong Zhu, Hongxiao Jia, Sisi Zheng and Hubert P. H. Shum, "Cross-view Multimodal Vision-Based Assessment Framework for Traditional Chinese Medicine Rehabilitation Training," IEEE Transactions on Neural Systems and Rehabilitation Engineering, 2026.

BibTeX

@article{zhang26cross,
 author={Zhang, Francis Xiatian and Yao, Hao and Chen, Shengxuan and Zhu, Hong and Jia, Hongxiao and Zheng, Sisi and Shum, Hubert P. H.},
 journal={IEEE Transactions on Neural Systems and Rehabilitation Engineering},
 title={Cross-view Multimodal Vision-Based Assessment Framework for Traditional Chinese Medicine Rehabilitation Training},
 year={2026},
}

RIS

TY  - JOUR
AU  - Zhang, Francis Xiatian
AU  - Yao, Hao
AU  - Chen, Shengxuan
AU  - Zhu, Hong
AU  - Jia, Hongxiao
AU  - Zheng, Sisi
AU  - Shum, Hubert P. H.
T2  - IEEE Transactions on Neural Systems and Rehabilitation Engineering
TI  - Cross-view Multimodal Vision-Based Assessment Framework for Traditional Chinese Medicine Rehabilitation Training
PY  - 2026
ER  - 


Supporting Grants


Similar Research


HomeGoogle ScholarLinkedInYouTubeGitHubORCIDResearchGateEmail
 
Print