Image Super-Resolution using Efficient Striped Window Transformer

doi:10.48550/arXiv.2301.09869

Image Super-Resolution using Efficient Striped Window Transformer

Transformers have achieved remarkable results in single-image super-resolution (SR). However, the challenge of balancing model performance and complexity has hindered their application in lightweight SR (LSR). To tackle this challenge, we propose an efficient striped window transformer (ESWT). We revisit the normalization layer in the transformer and design a concise and efficient transformer structure to build the ESWT. Furthermore, we introduce a striped window mechanism to model long-term dependencies more efficiently. To fully exploit the potential of the ESWT, we propose a novel flexible window training strategy that can improve the performance of the ESWT without additional cost. Extensive experiments show that ESWT outperforms state-of-the-art LSR transformers, and achieves a better trade-off between model performance and complexity. The ESWT requires fewer parameters, incurs faster inference, smaller FLOPs, and less memory consumption, making it a promising solution for LSR.

Publication:

arXiv e-prints

Pub Date:

January 2023

DOI:

10.48550/arXiv.2301.09869

arXiv:

arXiv:2301.09869

Bibcode:

2023arXiv230109869S

Keywords:

Computer Science - Computer Vision and Pattern Recognition

E-Print:

SOTA lightweight super-resolution transformer. 8 pages, 9 figures and 6 tables. The Code is available at https://github.com/Fried-Rice-Lab/FriedRiceLab

NASA/ADS

Image Super-Resolution using Efficient Striped Window Transformer

Abstract