Staff View: :: Library Catalog

Saved in:

Bibliographic Details
Main Authors:	Cheng, Runze, Sun, Yao, Taha, Ahmad, Liu, Xuesong, Flynn, David, Imran, Muhammad Ali
Format:	Preprint
Published:	2026
Subjects:	Image and Video Processing Computer Vision and Pattern Recognition
Online Access:	https://arxiv.org/abs/2601.22202
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1866918531581345792
author	Cheng, Runze Sun, Yao Taha, Ahmad Liu, Xuesong Flynn, David Imran, Muhammad Ali
author_facet	Cheng, Runze Sun, Yao Taha, Ahmad Liu, Xuesong Flynn, David Imran, Muhammad Ali
contents	Semantic communication (SemCom) emerges as a transformative paradigm for traffic-intensive visual data transmission, shifting focus from raw data to meaningful content transmission and relieving the increasing pressure on communication resources. However, to achieve SemCom, challenges are faced in accurate semantic quantization for visual data, robust semantic extraction and reconstruction under diverse tasks and goals, transceiver coordination with effective knowledge utilization, and adaptation to unpredictable wireless communication environments. In this paper, we present a systematic review of SemCom for visual data transmission (SemCom-Vision), wherein an interdisciplinary analysis integrating computer vision (CV) and communication engineering is conducted to provide comprehensive guidelines for the machine learning (ML)-empowered SemCom-Vision design. Specifically, this survey first elucidates the basics and key concepts of SemCom. Then, we introduce a novel classification perspective to categorize existing SemCom-Vision approaches as semantic preservation communication (SPC), semantic expansion communication (SEC), and semantic refinement communication (SRC) based on communication goals interpreted through semantic quantization schemes. Moreover, this survey articulates the ML-based encoder-decoder models and training algorithms for each SemCom-Vision category, followed by knowledge structure and utilization strategies. Finally, we discuss potential SemCom-Vision applications.
format	Preprint
id	arxiv_https___arxiv_org_abs_2601_22202
institution	arXiv
publishDate	2026
record_format	arxiv
spellingShingle	A Survey on Semantic Communication for Vision: Categories, Frameworks, Enabling Techniques, and Applications Cheng, Runze Sun, Yao Taha, Ahmad Liu, Xuesong Flynn, David Imran, Muhammad Ali Image and Video Processing Computer Vision and Pattern Recognition Semantic communication (SemCom) emerges as a transformative paradigm for traffic-intensive visual data transmission, shifting focus from raw data to meaningful content transmission and relieving the increasing pressure on communication resources. However, to achieve SemCom, challenges are faced in accurate semantic quantization for visual data, robust semantic extraction and reconstruction under diverse tasks and goals, transceiver coordination with effective knowledge utilization, and adaptation to unpredictable wireless communication environments. In this paper, we present a systematic review of SemCom for visual data transmission (SemCom-Vision), wherein an interdisciplinary analysis integrating computer vision (CV) and communication engineering is conducted to provide comprehensive guidelines for the machine learning (ML)-empowered SemCom-Vision design. Specifically, this survey first elucidates the basics and key concepts of SemCom. Then, we introduce a novel classification perspective to categorize existing SemCom-Vision approaches as semantic preservation communication (SPC), semantic expansion communication (SEC), and semantic refinement communication (SRC) based on communication goals interpreted through semantic quantization schemes. Moreover, this survey articulates the ML-based encoder-decoder models and training algorithms for each SemCom-Vision category, followed by knowledge structure and utilization strategies. Finally, we discuss potential SemCom-Vision applications.
title	A Survey on Semantic Communication for Vision: Categories, Frameworks, Enabling Techniques, and Applications
topic	Image and Video Processing Computer Vision and Pattern Recognition
url	https://arxiv.org/abs/2601.22202

Similar Items