Vista Equipo: :: Library Catalog

Guardado en:

Detalles Bibliográficos
Autores principales:	Lin, Wei-Tung, Lin, Yong-Xiang, Chen, Jyun-Wei, Hua, Kai-Lung
Formato:	Preprint
Publicado:	2024
Materias:	Computer Vision and Pattern Recognition
Acceso en línea:	https://arxiv.org/abs/2406.08444
Etiquetas:	Agregar Etiqueta Sin Etiquetas, Sea el primero en etiquetar este registro!

_version_	1866914833093361664
author	Lin, Wei-Tung Lin, Yong-Xiang Chen, Jyun-Wei Hua, Kai-Lung
author_facet	Lin, Wei-Tung Lin, Yong-Xiang Chen, Jyun-Wei Hua, Kai-Lung
contents	Underwater Image Enhancement (UIE) is critical for marine research and exploration but hindered by complex color distortions and severe blurring. Recent deep learning-based methods have achieved remarkable results, yet these methods struggle with high computational costs and insufficient global modeling, resulting in locally under- or over- adjusted regions. We present PixMamba, a novel architecture, designed to overcome these challenges by leveraging State Space Models (SSMs) for efficient global dependency modeling. Unlike convolutional neural networks (CNNs) with limited receptive fields and transformer networks with high computational costs, PixMamba efficiently captures global contextual information while maintaining computational efficiency. Our dual-level strategy features the patch-level Efficient Mamba Net (EMNet) for reconstructing enhanced image feature and the pixel-level PixMamba Net (PixNet) to ensure fine-grained feature capturing and global consistency of enhanced image that were previously difficult to obtain. PixMamba achieves state-of-the-art performance across various underwater image datasets and delivers visually superior results. Code is available at: https://github.com/weitunglin/pixmamba.
format	Preprint
id	arxiv_https___arxiv_org_abs_2406_08444
institution	arXiv
publishDate	2024
record_format	arxiv
spellingShingle	PixMamba: Leveraging State Space Models in a Dual-Level Architecture for Underwater Image Enhancement Lin, Wei-Tung Lin, Yong-Xiang Chen, Jyun-Wei Hua, Kai-Lung Computer Vision and Pattern Recognition Underwater Image Enhancement (UIE) is critical for marine research and exploration but hindered by complex color distortions and severe blurring. Recent deep learning-based methods have achieved remarkable results, yet these methods struggle with high computational costs and insufficient global modeling, resulting in locally under- or over- adjusted regions. We present PixMamba, a novel architecture, designed to overcome these challenges by leveraging State Space Models (SSMs) for efficient global dependency modeling. Unlike convolutional neural networks (CNNs) with limited receptive fields and transformer networks with high computational costs, PixMamba efficiently captures global contextual information while maintaining computational efficiency. Our dual-level strategy features the patch-level Efficient Mamba Net (EMNet) for reconstructing enhanced image feature and the pixel-level PixMamba Net (PixNet) to ensure fine-grained feature capturing and global consistency of enhanced image that were previously difficult to obtain. PixMamba achieves state-of-the-art performance across various underwater image datasets and delivers visually superior results. Code is available at: https://github.com/weitunglin/pixmamba.
title	PixMamba: Leveraging State Space Models in a Dual-Level Architecture for Underwater Image Enhancement
topic	Computer Vision and Pattern Recognition
url	https://arxiv.org/abs/2406.08444

Ejemplares similares