Saved in:
Bibliographic Details
Main Authors: Fang, Siyan, Wang, Yuntao, Zhang, Jinpu, Li, Ziwen, Wang, Yuehuan
Format: Preprint
Published: 2026
Subjects:
Online Access:https://arxiv.org/abs/2603.06200
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866914375195951104
author Fang, Siyan
Wang, Yuntao
Zhang, Jinpu
Li, Ziwen
Wang, Yuehuan
author_facet Fang, Siyan
Wang, Yuntao
Zhang, Jinpu
Li, Ziwen
Wang, Yuehuan
contents Existing image reflection removal methods struggle to handle complex reflections. Accurate language descriptions can help the model understand the image content to remove complex reflections. However, due to blurred and distorted interferences in reflected images, machine-generated language descriptions of the image content are often inaccurate, which harms the performance of language-guided reflection removal. To address this, we propose the Adaptive Language-Aware Network (ALANet) to remove reflections even with inaccurate language inputs. Specifically, ALANet integrates both filtering and optimization strategies. The filtering strategy reduces the negative effects of language while preserving its benefits, whereas the optimization strategy enhances the alignment between language and visual features. ALANet also utilizes language cues to decouple specific layer content from feature maps, improving its ability to handle complex reflections. To evaluate the model's performance under complex reflections and varying levels of language accuracy, we introduce the Complex Reflection and Language Accuracy Variance (CRLAV) dataset. Experimental results demonstrate that ALANet surpasses state-of-the-art methods for image reflection removal. The code and dataset are available at https://github.com/fashyon/ALANet.
format Preprint
id arxiv_https___arxiv_org_abs_2603_06200
institution arXiv
publishDate 2026
record_format arxiv
spellingShingle Adaptive Language-Aware Image Reflection Removal Network
Fang, Siyan
Wang, Yuntao
Zhang, Jinpu
Li, Ziwen
Wang, Yuehuan
Computer Vision and Pattern Recognition
Existing image reflection removal methods struggle to handle complex reflections. Accurate language descriptions can help the model understand the image content to remove complex reflections. However, due to blurred and distorted interferences in reflected images, machine-generated language descriptions of the image content are often inaccurate, which harms the performance of language-guided reflection removal. To address this, we propose the Adaptive Language-Aware Network (ALANet) to remove reflections even with inaccurate language inputs. Specifically, ALANet integrates both filtering and optimization strategies. The filtering strategy reduces the negative effects of language while preserving its benefits, whereas the optimization strategy enhances the alignment between language and visual features. ALANet also utilizes language cues to decouple specific layer content from feature maps, improving its ability to handle complex reflections. To evaluate the model's performance under complex reflections and varying levels of language accuracy, we introduce the Complex Reflection and Language Accuracy Variance (CRLAV) dataset. Experimental results demonstrate that ALANet surpasses state-of-the-art methods for image reflection removal. The code and dataset are available at https://github.com/fashyon/ALANet.
title Adaptive Language-Aware Image Reflection Removal Network
topic Computer Vision and Pattern Recognition
url https://arxiv.org/abs/2603.06200