Guardado en:
Detalles Bibliográficos
Autores principales: Fan, Xiaotian, Zhou, Xingyu, Liang, Le, Li, Xiao, Jin, Shi
Formato: Preprint
Publicado: 2026
Materias:
Acceso en línea:https://arxiv.org/abs/2603.13440
Etiquetas: Agregar Etiqueta
Sin Etiquetas, Sea el primero en etiquetar este registro!
_version_ 1866908885389934592
author Fan, Xiaotian
Zhou, Xingyu
Liang, Le
Li, Xiao
Jin, Shi
author_facet Fan, Xiaotian
Zhou, Xingyu
Liang, Le
Li, Xiao
Jin, Shi
contents Deep generative models offer a powerful alternative to conventional channel estimation by learning complex channel distributions. By integrating the rich environmental information available in modern sensing-aided networks, this paper proposes MultiCE-Flow, a multimodal channel estimation framework based on flow matching and diffusion transformer (DiT). We design a specialized multimodal perception module that fuses LiDAR, camera, and location data into a semantic condition, while treating sparse pilots as a structural condition. These conditions guide a DiT backbone to reconstruct high-fidelity channels. Unlike standard diffusion models, we employ flow matching to learn a linear trajectory from noise to data, enabling efficient one-step sampling. By leveraging environmental semantics, our method mitigates the ill-posed nature of estimation with sparse pilots. Extensive experiments demonstrate that MultiCE-Flow consistently outperforms traditional baselines and existing generative models. Notably, it exhibits superior robustness to out-of-distribution scenarios and varying pilot densities, making it suitable for environment-aware communication systems.
format Preprint
id arxiv_https___arxiv_org_abs_2603_13440
institution arXiv
publishDate 2026
record_format arxiv
spellingShingle Improving Channel Estimation via Multimodal Diffusion Models with Flow Matching
Fan, Xiaotian
Zhou, Xingyu
Liang, Le
Li, Xiao
Jin, Shi
Machine Learning
Information Theory
Deep generative models offer a powerful alternative to conventional channel estimation by learning complex channel distributions. By integrating the rich environmental information available in modern sensing-aided networks, this paper proposes MultiCE-Flow, a multimodal channel estimation framework based on flow matching and diffusion transformer (DiT). We design a specialized multimodal perception module that fuses LiDAR, camera, and location data into a semantic condition, while treating sparse pilots as a structural condition. These conditions guide a DiT backbone to reconstruct high-fidelity channels. Unlike standard diffusion models, we employ flow matching to learn a linear trajectory from noise to data, enabling efficient one-step sampling. By leveraging environmental semantics, our method mitigates the ill-posed nature of estimation with sparse pilots. Extensive experiments demonstrate that MultiCE-Flow consistently outperforms traditional baselines and existing generative models. Notably, it exhibits superior robustness to out-of-distribution scenarios and varying pilot densities, making it suitable for environment-aware communication systems.
title Improving Channel Estimation via Multimodal Diffusion Models with Flow Matching
topic Machine Learning
Information Theory
url https://arxiv.org/abs/2603.13440