Saved in:
Bibliographic Details
Main Authors: Ma, Chuang, Zhao, Shaokai, Zhou, Dongdong, Pei, Yu, Luo, Zhiguo, Xie, Liang, Yan, Ye, Yin, Erwei
Format: Preprint
Published: 2025
Subjects:
Online Access:https://arxiv.org/abs/2506.09735
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866909646053179392
author Ma, Chuang
Zhao, Shaokai
Zhou, Dongdong
Pei, Yu
Luo, Zhiguo
Xie, Liang
Yan, Ye
Yin, Erwei
author_facet Ma, Chuang
Zhao, Shaokai
Zhou, Dongdong
Pei, Yu
Luo, Zhiguo
Xie, Liang
Yan, Ye
Yin, Erwei
contents Micro-expression recognition (MER), a critical subfield of affective computing, presents greater challenges than macro-expression recognition due to its brief duration and low intensity. While incorporating prior knowledge has been shown to enhance MER performance, existing methods predominantly rely on simplistic, singular sources of prior knowledge, failing to fully exploit multi-source information. This paper introduces the Multi-Prior Fusion Network (MPFNet), leveraging a progressive training strategy to optimize MER tasks. We propose two complementary encoders: the Generic Feature Encoder (GFE) and the Advanced Feature Encoder (AFE), both based on Inflated 3D ConvNets (I3D) with Coordinate Attention (CA) mechanisms, to improve the model's ability to capture spatiotemporal and channel-specific features. Inspired by developmental psychology, we present two variants of MPFNet--MPFNet-P and MPFNet-C--corresponding to two fundamental modes of infant cognitive development: parallel and hierarchical processing. These variants enable the evaluation of different strategies for integrating prior knowledge. Extensive experiments demonstrate that MPFNet significantly improves MER accuracy while maintaining balanced performance across categories, achieving accuracies of 0.811, 0.924, and 0.857 on the SMIC, CASME II, and SAMM datasets, respectively. To the best of our knowledge, our approach achieves state-of-the-art performance on the SMIC and SAMM datasets.
format Preprint
id arxiv_https___arxiv_org_abs_2506_09735
institution arXiv
publishDate 2025
record_format arxiv
spellingShingle MPFNet: A Multi-Prior Fusion Network with a Progressive Training Strategy for Micro-Expression Recognition
Ma, Chuang
Zhao, Shaokai
Zhou, Dongdong
Pei, Yu
Luo, Zhiguo
Xie, Liang
Yan, Ye
Yin, Erwei
Computer Vision and Pattern Recognition
Micro-expression recognition (MER), a critical subfield of affective computing, presents greater challenges than macro-expression recognition due to its brief duration and low intensity. While incorporating prior knowledge has been shown to enhance MER performance, existing methods predominantly rely on simplistic, singular sources of prior knowledge, failing to fully exploit multi-source information. This paper introduces the Multi-Prior Fusion Network (MPFNet), leveraging a progressive training strategy to optimize MER tasks. We propose two complementary encoders: the Generic Feature Encoder (GFE) and the Advanced Feature Encoder (AFE), both based on Inflated 3D ConvNets (I3D) with Coordinate Attention (CA) mechanisms, to improve the model's ability to capture spatiotemporal and channel-specific features. Inspired by developmental psychology, we present two variants of MPFNet--MPFNet-P and MPFNet-C--corresponding to two fundamental modes of infant cognitive development: parallel and hierarchical processing. These variants enable the evaluation of different strategies for integrating prior knowledge. Extensive experiments demonstrate that MPFNet significantly improves MER accuracy while maintaining balanced performance across categories, achieving accuracies of 0.811, 0.924, and 0.857 on the SMIC, CASME II, and SAMM datasets, respectively. To the best of our knowledge, our approach achieves state-of-the-art performance on the SMIC and SAMM datasets.
title MPFNet: A Multi-Prior Fusion Network with a Progressive Training Strategy for Micro-Expression Recognition
topic Computer Vision and Pattern Recognition
url https://arxiv.org/abs/2506.09735