Saved in:
Bibliographic Details
Main Authors: Yin, Xinyi, Wang, Yiduo, Hu, Tingqi, Si, Meicong, Shi, Yunyun, Chen, Shi, Wang, Hao, Xue, Junxiao, Wu, Xuecheng
Format: Preprint
Published: 2026
Subjects:
Online Access:https://arxiv.org/abs/2605.02521
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866913121801601024
author Yin, Xinyi
Wang, Yiduo
Hu, Tingqi
Si, Meicong
Shi, Yunyun
Chen, Shi
Wang, Hao
Xue, Junxiao
Wu, Xuecheng
author_facet Yin, Xinyi
Wang, Yiduo
Hu, Tingqi
Si, Meicong
Shi, Yunyun
Chen, Shi
Wang, Hao
Xue, Junxiao
Wu, Xuecheng
contents Affective Image Editing (AIE) aims to modify visual content to evoke targeted emotions. Although current approaches achieve impressive editing quality, they often overlook inference efficiency, which limits their applicability in computational social scenarios. Moreover, most methods depend on discrete emotion representations, which hinder the continuous modeling of complex human emotions and constrain expressive capabilities in interactive scenarios. To tackle these gaps, we propose MooD, the first framework that directly leverages continuous Valence-Arousal (VA) values as editing instruction for fine-grained and efficient AIE in computational social systems. Specifically, we first introduce a VA-Aware retrieval strategy to bridge vague affective values and detailed visual semantics. Building upon this, MooD integrates visual transfer and perception-enhanced semantic guidance to achieve controllable AIE. Furthermore, considering that existing VA-annotated datasets mainly focus on social scenarios and largely overlook natural scenes, we therefore construct AffectSet, a comprehensive VA-annotated dataset covering diverse scenarios, to support model optimization and evaluation. Extensive qualitative and quantitative experimental results demonstrate that our MooD achieves superior performance in both affective controllability and visual fidelity while maintaining high efficiency. A series of ablation studies further reveal the crucial factors of our design.
format Preprint
id arxiv_https___arxiv_org_abs_2605_02521
institution arXiv
publishDate 2026
record_format arxiv
spellingShingle MooD: Perception-Enhanced Efficient Affective Image Editing via Continuous Valence-Arousal Modeling
Yin, Xinyi
Wang, Yiduo
Hu, Tingqi
Si, Meicong
Shi, Yunyun
Chen, Shi
Wang, Hao
Xue, Junxiao
Wu, Xuecheng
Computer Vision and Pattern Recognition
Affective Image Editing (AIE) aims to modify visual content to evoke targeted emotions. Although current approaches achieve impressive editing quality, they often overlook inference efficiency, which limits their applicability in computational social scenarios. Moreover, most methods depend on discrete emotion representations, which hinder the continuous modeling of complex human emotions and constrain expressive capabilities in interactive scenarios. To tackle these gaps, we propose MooD, the first framework that directly leverages continuous Valence-Arousal (VA) values as editing instruction for fine-grained and efficient AIE in computational social systems. Specifically, we first introduce a VA-Aware retrieval strategy to bridge vague affective values and detailed visual semantics. Building upon this, MooD integrates visual transfer and perception-enhanced semantic guidance to achieve controllable AIE. Furthermore, considering that existing VA-annotated datasets mainly focus on social scenarios and largely overlook natural scenes, we therefore construct AffectSet, a comprehensive VA-annotated dataset covering diverse scenarios, to support model optimization and evaluation. Extensive qualitative and quantitative experimental results demonstrate that our MooD achieves superior performance in both affective controllability and visual fidelity while maintaining high efficiency. A series of ablation studies further reveal the crucial factors of our design.
title MooD: Perception-Enhanced Efficient Affective Image Editing via Continuous Valence-Arousal Modeling
topic Computer Vision and Pattern Recognition
url https://arxiv.org/abs/2605.02521