May 5, 2023

WITT: A Wireless Image Transmission Transformer for Semantic Communications

Key Points

Key points are not available for this paper at this time.

Abstract

In this paper, we aim to redesign the vision Transformer (ViT) as a new backbone to realize semantic image transmission, termed wireless image transmission transformer (WITT). Previous works build upon convolutional neural networks (CNNs), which are inefficient in capturing global dependencies, resulting in degraded end-to-end transmission performance especially for high-resolution images. To tackle this, the proposed WITT employs Swin Transformers as a more capable backbone to extract long-range information. Different from ViTs in image classification tasks, WITT is highly optimized for image transmission while considering the effect of the wireless channel. Specifically, we propose a spatial modulation module to scale the latent representations according to channel state information, which enhances the ability of a single model to deal with various channel conditions. As a result, extensive experiments verify that our WITT attains better performance for different image resolutions, distortion metrics, and channel conditions. The code is available at https://github.com/KeYang8/WITT.

Connected Papers

Building similarity graph...

Analyzing shared references across papers

Discussion

Authors

Ke Yang

Jiangsu University of Science and Technology

Sixian Wang

Shanghai Jiao Tong University

Jincheng Dai

Qingdao University

Actions

Institutions

Beijing University of Posts and Telecommunications

Peng Cheng Laboratory

References and Citations

Connected Papers

Building similarity graph...

Analyzing shared references across papers

WITT: A Wireless Image Transmission Transformer for Semantic Communications

Key Points

Abstract

Citation Network

Connected Papers

Discussion

Authors

Actions

Institutions

References and Citations

Citation Network

Connected Papers

Discussion

Cite this study