4.5 Review

Attention mechanisms in computer vision: A survey

Journal

COMPUTATIONAL VISUAL MEDIA
Volume 8, Issue 3, Pages 331-368

Publisher

TSINGHUA UNIV PRESS
DOI: 10.1007/s41095-022-0271-y

Keywords

attention; transformer; computer vision; deep learning; salience

Funding

  1. National Natural Science Foundation of China [61521002, 62132012]

Ask authors/readers for more resources

Attention mechanisms, inspired by the human visual system, have been successfully applied in various computer vision tasks. This survey provides a comprehensive review of different types of attention mechanisms and suggests future research directions.
Humans can naturally and effectively find salient regions in complex scenes. Motivated by this observation, attention mechanisms were introduced into computer vision with the aim of imitating this aspect of the human visual system. Such an attention mechanism can be regarded as a dynamic weight adjustment process based on features of the input image. Attention mechanisms have achieved great success in many visual tasks, including image classification, object detection, semantic segmentation, video understanding, image generation, 3D vision, multimodal tasks, and self-supervised learning. In this survey, we provide a comprehensive review of various attention mechanisms in computer vision and categorize them according to approach, such as channel attention, spatial attention, temporal attention, and branch attention; a related repository https://github.com/MenghaoGuo/Awesome-Vision-Attentions is dedicated to collecting related work. We also suggest future directions for attention mechanism research.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.5
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available