Saved in:
Bibliographic Details
Main Authors: Zhou, Kanglei, Li, Chang, Pan, Qingyi, Wang, Liyuan
Format: Preprint
Published: 2026
Subjects:
Online Access:https://arxiv.org/abs/2602.19170
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866908847365423104
author Zhou, Kanglei
Li, Chang
Pan, Qingyi
Wang, Liyuan
author_facet Zhou, Kanglei
Li, Chang
Pan, Qingyi
Wang, Liyuan
contents Action Quality Assessment (AQA) aims to score how well an action is performed and is widely used in sports analysis, rehabilitation assessment, and human skill evaluation. Multi-modal AQA has recently achieved strong progress by leveraging complementary visual and kinematic cues, yet real-world deployments often suffer from non-stationary modality imbalance, where certain modalities become missing or intermittently available due to sensor failures or annotation gaps. Existing continual AQA methods overlook this issue and assume that all modalities remain complete and stable throughout training, which restricts their practicality. To address this challenge, we introduce Bridged Modality Adaptation (BriMA), an innovative approach to multi-modal continual AQA under modality-missing conditions. BriMA consists of a memory-guided bridging imputation module that reconstructs missing modalities using both task-agnostic and task-specific representations, and a modality-aware replay mechanism that prioritizes informative samples based on modality distortion and distribution drift. Experiments on three representative multi-modal AQA datasets (RG, Fis-V, and FS1000) show that BriMA consistently improves performance under different modality-missing conditions, achieving 6--8\% higher correlation and 12--15\% lower error on average. These results demonstrate a step toward robust multi-modal AQA systems under real-world deployment constraints.
format Preprint
id arxiv_https___arxiv_org_abs_2602_19170
institution arXiv
publishDate 2026
record_format arxiv
spellingShingle BriMA: Bridged Modality Adaptation for Multi-Modal Continual Action Quality Assessment
Zhou, Kanglei
Li, Chang
Pan, Qingyi
Wang, Liyuan
Computer Vision and Pattern Recognition
Action Quality Assessment (AQA) aims to score how well an action is performed and is widely used in sports analysis, rehabilitation assessment, and human skill evaluation. Multi-modal AQA has recently achieved strong progress by leveraging complementary visual and kinematic cues, yet real-world deployments often suffer from non-stationary modality imbalance, where certain modalities become missing or intermittently available due to sensor failures or annotation gaps. Existing continual AQA methods overlook this issue and assume that all modalities remain complete and stable throughout training, which restricts their practicality. To address this challenge, we introduce Bridged Modality Adaptation (BriMA), an innovative approach to multi-modal continual AQA under modality-missing conditions. BriMA consists of a memory-guided bridging imputation module that reconstructs missing modalities using both task-agnostic and task-specific representations, and a modality-aware replay mechanism that prioritizes informative samples based on modality distortion and distribution drift. Experiments on three representative multi-modal AQA datasets (RG, Fis-V, and FS1000) show that BriMA consistently improves performance under different modality-missing conditions, achieving 6--8\% higher correlation and 12--15\% lower error on average. These results demonstrate a step toward robust multi-modal AQA systems under real-world deployment constraints.
title BriMA: Bridged Modality Adaptation for Multi-Modal Continual Action Quality Assessment
topic Computer Vision and Pattern Recognition
url https://arxiv.org/abs/2602.19170