PHI: Bridging Domain Shift in Long-Term Action Quality Assessment via Progressive Hierarchical Instruction

Kanglei Zhou, Hubert P. H. Shum, Frederick W. B. Li, Xingxing Zhang and Xiaohui Liang
IEEE Transactions on Image Processing (TIP), 2025

Impact Factor: 13.7Top 10% Journal in Computer Science, Artificial Intelligence

PHI: Bridging Domain Shift in Long-Term Action Quality Assessment via Progressive Hierarchical Instruction

Abstract

Long-term Action Quality Assessment (AQA) aims to evaluate the quantitative performance of actions in long videos. However, existing methods face challenges due to domain shifts between the pre-trained large-scale action recognition backbones and the specific AQA task, hindering performance. This arises since fine-tuning intensive backbones on small AQA datasets is impractical. We address this by distinguishing domain shifts into task-level, regarding differences in task objectives, and feature-level, regarding differences in important features. For feature-level shifts, which are more detrimental, we propose Progressive Hierarchical Instruction (PHI) with two strategies. First, Gap Minimization Flow (GMF) leverages flow matching to progressively learn a fast flow path that reduces the domain gap between initial and desired features across shallow to deep layers. Additionally, a temporally-enhanced attention module captures long-range dependencies essential for AQA. Second, List-wise Contrastive Regularization (LCR) facilitates coarse-to-fine alignment by comprehensively comparing batch pairs to learn fine-grained cues while mitigating shift. Integrating these, PHI offers an effective solution. Experiments demonstrate that PHI achieves state-of-the-art performance on three representative long-term AQA datasets, proving its superiority in addressing the domain shift issue for long-term AQA. \footnote{Our code is included in the supplementary material for examination.


Downloads


YouTube


Cite This Research

Plain Text

Kanglei Zhou, Hubert P. H. Shum, Frederick W. B. Li, Xingxing Zhang and Xiaohui Liang, "PHI: Bridging Domain Shift in Long-Term Action Quality Assessment via Progressive Hierarchical Instruction," IEEE Transactions on Image Processing, vol. 34, pp. 3718-3732, IEEE, 2025.

BibTeX

@article{zhou25phi,
 author={Zhou, Kanglei and Shum, Hubert P. H. and Li, Frederick W. B. and Zhang, Xingxing and Liang, Xiaohui},
 journal={IEEE Transactions on Image Processing},
 title={PHI: Bridging Domain Shift in Long-Term Action Quality Assessment via Progressive Hierarchical Instruction},
 year={2025},
 volume={34},
 pages={3718--3732},
 numpages={15},
 doi={10.1109/TIP.2025.3574938},
 issn={1057-7149},
 publisher={IEEE},
}

RIS

TY  - JOUR
AU  - Zhou, Kanglei
AU  - Shum, Hubert P. H.
AU  - Li, Frederick W. B.
AU  - Zhang, Xingxing
AU  - Liang, Xiaohui
T2  - IEEE Transactions on Image Processing
TI  - PHI: Bridging Domain Shift in Long-Term Action Quality Assessment via Progressive Hierarchical Instruction
PY  - 2025
VL  - 34
SP  - 3718
EP  - 3732
DO  - 10.1109/TIP.2025.3574938
SN  - 1057-7149
PB  - IEEE
ER  - 


Supporting Grants


Similar Research

Kanglei Zhou, Yue Ma, Hubert P. H. Shum and Xiaohui Liang, "Hierarchical Graph Convolutional Networks for Action Quality Assessment", IEEE Transactions on Circuits and Systems for Video Technology (TCSVT), 2023
Ruisheng Han, Kanglei Zhou, Amir Atapour-Abarghouei, Xiaohui Liang and Hubert P. H. Shum, "FineCausal: A Causal-Based Framework for Interpretable Fine-Grained Action Quality Assessment", Proceedings of the 2025 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW), 2025
Kanglei Zhou, Liyuan Wang, Xingxing Zhang, Hubert P. H. Shum, Frederick W. B. Li, Jianguo Li and Xiaohui Liang, "MAGR: Manifold-Aligned Graph Regularization for Continual Action Quality Assessment", Proceedings of the 2024 European Conference on Computer Vision (ECCV), 2024
Kanglei Zhou, Ruizhi Cai, Liyuan Wang, Hubert P. H. Shum and Xiaohui Liang, "A Comprehensive Survey of Action Quality Assessment: Method and Benchmark", arXiv Preprint, 2024
Kanglei Zhou, Ruizhi Cai, Yue Ma, Qingqing Tan, Xinning Wang, Jianguo Li, Hubert P. H. Shum, Frederick W. B. Li, Song Jin and Xiaohui Liang, "A Video-Based Augmented Reality System for Human-in-the-Loop Muscle Strength Assessment of Juvenile Dermatomyositis", IEEE Transactions on Visualization and Computer Graphics (TVCG) - Proceedings of the 2023 IEEE Conference on Virtual Reality and 3D User Interfaces (VR), 2023

HomeGoogle ScholarYouTubeLinkedInTwitter/XGitHubORCIDResearchGateEmail
 
Print