Staff View: :: Library Catalog

Saved in:

Bibliographic Details
Main Authors:	Yin, Xinyi, Wang, Yiduo, Hu, Tingqi, Si, Meicong, Shi, Yunyun, Chen, Shi, Wang, Hao, Xue, Junxiao, Wu, Xuecheng
Format:	Preprint
Published:	2026
Subjects:	Computer Vision and Pattern Recognition
Online Access:	https://arxiv.org/abs/2605.02521
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1866913121801601024
author	Yin, Xinyi Wang, Yiduo Hu, Tingqi Si, Meicong Shi, Yunyun Chen, Shi Wang, Hao Xue, Junxiao Wu, Xuecheng
author_facet	Yin, Xinyi Wang, Yiduo Hu, Tingqi Si, Meicong Shi, Yunyun Chen, Shi Wang, Hao Xue, Junxiao Wu, Xuecheng
contents	Affective Image Editing (AIE) aims to modify visual content to evoke targeted emotions. Although current approaches achieve impressive editing quality, they often overlook inference efficiency, which limits their applicability in computational social scenarios. Moreover, most methods depend on discrete emotion representations, which hinder the continuous modeling of complex human emotions and constrain expressive capabilities in interactive scenarios. To tackle these gaps, we propose MooD, the first framework that directly leverages continuous Valence-Arousal (VA) values as editing instruction for fine-grained and efficient AIE in computational social systems. Specifically, we first introduce a VA-Aware retrieval strategy to bridge vague affective values and detailed visual semantics. Building upon this, MooD integrates visual transfer and perception-enhanced semantic guidance to achieve controllable AIE. Furthermore, considering that existing VA-annotated datasets mainly focus on social scenarios and largely overlook natural scenes, we therefore construct AffectSet, a comprehensive VA-annotated dataset covering diverse scenarios, to support model optimization and evaluation. Extensive qualitative and quantitative experimental results demonstrate that our MooD achieves superior performance in both affective controllability and visual fidelity while maintaining high efficiency. A series of ablation studies further reveal the crucial factors of our design.
format	Preprint
id	arxiv_https___arxiv_org_abs_2605_02521
institution	arXiv
publishDate	2026
record_format	arxiv
spellingShingle	MooD: Perception-Enhanced Efficient Affective Image Editing via Continuous Valence-Arousal Modeling Yin, Xinyi Wang, Yiduo Hu, Tingqi Si, Meicong Shi, Yunyun Chen, Shi Wang, Hao Xue, Junxiao Wu, Xuecheng Computer Vision and Pattern Recognition Affective Image Editing (AIE) aims to modify visual content to evoke targeted emotions. Although current approaches achieve impressive editing quality, they often overlook inference efficiency, which limits their applicability in computational social scenarios. Moreover, most methods depend on discrete emotion representations, which hinder the continuous modeling of complex human emotions and constrain expressive capabilities in interactive scenarios. To tackle these gaps, we propose MooD, the first framework that directly leverages continuous Valence-Arousal (VA) values as editing instruction for fine-grained and efficient AIE in computational social systems. Specifically, we first introduce a VA-Aware retrieval strategy to bridge vague affective values and detailed visual semantics. Building upon this, MooD integrates visual transfer and perception-enhanced semantic guidance to achieve controllable AIE. Furthermore, considering that existing VA-annotated datasets mainly focus on social scenarios and largely overlook natural scenes, we therefore construct AffectSet, a comprehensive VA-annotated dataset covering diverse scenarios, to support model optimization and evaluation. Extensive qualitative and quantitative experimental results demonstrate that our MooD achieves superior performance in both affective controllability and visual fidelity while maintaining high efficiency. A series of ablation studies further reveal the crucial factors of our design.
title	MooD: Perception-Enhanced Efficient Affective Image Editing via Continuous Valence-Arousal Modeling
topic	Computer Vision and Pattern Recognition
url	https://arxiv.org/abs/2605.02521

Similar Items