Journal
2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022)
Volume -, Issue -, Pages 5718-5729Publisher
IEEE COMPUTER SOC
DOI: 10.1109/CVPR52688.2022.00564
Keywords
-
Funding
- NSF CAREER grant [1149783]
- ARC DECRA Fellowship [DE200101100]
- Australian Research Council [DE200101100] Funding Source: Australian Research Council
Ask authors/readers for more resources
Convolutional neural networks (CNNs) perform well at learning image priors, while Transformers capture long-range pixel interactions. However, the computational complexity of Transformers makes it challenging to apply them to high-resolution image restoration tasks. This work proposes an efficient Transformer model, Restormer, which achieves state-of-the-art results on various image restoration tasks.
Since convolutional neural networks (CNNs) perform well at learning generalizable image priors from largescale data, these models have been extensively applied to image restoration and related tasks. Recently, another class of neural architectures, Transformers, have shown significant performance gains on natural language and high-level vision tasks. While the Transformer model mitigates the shortcomings of CNNs (i.e., limited receptive field and inadaptability to input content), its computational complexity grows quadratically with the spatial resolution, therefore making it infeasible to apply to most image restoration tasks involving high-resolution images. In this work, we propose an efficient Transformer model by making several key designs in the building blocks (multi-head attention and feed-forward network) such that it can capture long-range pixel interactions, while still remaining applicable to large images. Our model, named Restoration Transformer (Restormer), achieves state-of-the-art results on several image restoration tasks, including image deraining, single-image motion deblurring, defocus deblurring (single-image and dual-pixel data), and image denoising (Gaussian grayscale/color denoising, and real image denoising). The source code and pre-trained models are available at https://github.com/swz30/Restormer.
Authors
I am an author on this paper
Click your name to claim this paper and add it to your profile.
Reviews
Recommended
No Data Available