Vista Equipo: :: Library Catalog

Guardado en:

Detalles Bibliográficos
Autores principales:	Fan, Xiaotian, Zhou, Xingyu, Liang, Le, Li, Xiao, Jin, Shi
Formato:	Preprint
Publicado:	2026
Materias:	Machine Learning Information Theory
Acceso en línea:	https://arxiv.org/abs/2603.13440
Etiquetas:	Agregar Etiqueta Sin Etiquetas, Sea el primero en etiquetar este registro!

_version_	1866908885389934592
author	Fan, Xiaotian Zhou, Xingyu Liang, Le Li, Xiao Jin, Shi
author_facet	Fan, Xiaotian Zhou, Xingyu Liang, Le Li, Xiao Jin, Shi
contents	Deep generative models offer a powerful alternative to conventional channel estimation by learning complex channel distributions. By integrating the rich environmental information available in modern sensing-aided networks, this paper proposes MultiCE-Flow, a multimodal channel estimation framework based on flow matching and diffusion transformer (DiT). We design a specialized multimodal perception module that fuses LiDAR, camera, and location data into a semantic condition, while treating sparse pilots as a structural condition. These conditions guide a DiT backbone to reconstruct high-fidelity channels. Unlike standard diffusion models, we employ flow matching to learn a linear trajectory from noise to data, enabling efficient one-step sampling. By leveraging environmental semantics, our method mitigates the ill-posed nature of estimation with sparse pilots. Extensive experiments demonstrate that MultiCE-Flow consistently outperforms traditional baselines and existing generative models. Notably, it exhibits superior robustness to out-of-distribution scenarios and varying pilot densities, making it suitable for environment-aware communication systems.
format	Preprint
id	arxiv_https___arxiv_org_abs_2603_13440
institution	arXiv
publishDate	2026
record_format	arxiv
spellingShingle	Improving Channel Estimation via Multimodal Diffusion Models with Flow Matching Fan, Xiaotian Zhou, Xingyu Liang, Le Li, Xiao Jin, Shi Machine Learning Information Theory Deep generative models offer a powerful alternative to conventional channel estimation by learning complex channel distributions. By integrating the rich environmental information available in modern sensing-aided networks, this paper proposes MultiCE-Flow, a multimodal channel estimation framework based on flow matching and diffusion transformer (DiT). We design a specialized multimodal perception module that fuses LiDAR, camera, and location data into a semantic condition, while treating sparse pilots as a structural condition. These conditions guide a DiT backbone to reconstruct high-fidelity channels. Unlike standard diffusion models, we employ flow matching to learn a linear trajectory from noise to data, enabling efficient one-step sampling. By leveraging environmental semantics, our method mitigates the ill-posed nature of estimation with sparse pilots. Extensive experiments demonstrate that MultiCE-Flow consistently outperforms traditional baselines and existing generative models. Notably, it exhibits superior robustness to out-of-distribution scenarios and varying pilot densities, making it suitable for environment-aware communication systems.
title	Improving Channel Estimation via Multimodal Diffusion Models with Flow Matching
topic	Machine Learning Information Theory
url	https://arxiv.org/abs/2603.13440

Ejemplares similares