4.6 Article

IA-Net$:$ An Inception-Attention-Module-Based Network for Classifying Underwater Images From Others

期刊

IEEE JOURNAL OF OCEANIC ENGINEERING
卷 47, 期 3, 页码 704-717

出版社

IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC
DOI: 10.1109/JOE.2021.3126090

关键词

Visualization; Kernel; Image recognition; Convolution; Task analysis; Water; Oceans; Attention module; convolutional neural network (CNN); image classification; inception module; underwater image

资金

  1. National Natural Science Foundation of China [61601194]
  2. Natural Science Foundation of Jiangsu province [BK20191469]
  3. Special Foundation for Natural Resources Development of Jiangsu Province [JSZRHYKJ202116]
  4. Postgraduate Research and Innovation Program [DZXS202003, DZXS202004]

向作者/读者索取更多资源

This article reports a convolutional neural network (CNN)-based inception-attention network (IA-Net) model for classifying underwater images from natural images. By simulating the visual correlation mechanism of images taken from special environments, the importance of context background in addition to salient objects is demonstrated. The IA-Net achieves high accuracy in underwater image classification and outperforms other networks in distinguishing underwater images from similar nonunderwater images.
To distinguish underwater images from natural images is one of the challenge of collecting and generation of underwater image data. Common image classification and recognition models classify the objects in an image depending on the saliency while suppressing the background. In this article, an inception-attention network (IA-Net), a convolutional neural network (CNN)-based model to classify the underwater images from natural images is reported, in which an inception-attention (I-A) module is constructed to simulate the visual correlation mechanism of classifying images taken from special environments such as fog, nighttime and under water. It is illustrated that the context background is as important as the salient object when understanding the underwater images. We executed experiments on a data set, which consists of 4000 underwater images and 5000 nonunderwater images, and demonstrate that the proposed IA-Net achieves an accuracy of 99.3$\%$ on underwater image classification, which is significantly better than classical image classification networks, such as AlexNet, InceptionV3, and ResNet. In addition, the comparative experiments prove that the IA-Net is superior to other networks when distinguishing underwater images from foggy, nighttime images and fish images taken in nonunderwater environments, although these images have indistinguishable characteristics with underwater images. Moreover, we demonstrate the I-A structure we proposed can be used to boost the performance of the existing object recognition networks. By substituting the inception module with the I-A module, the Inception-ResNetV2 network achieves a 10.7$\%$ top-1 error rate on the subset of ILSVRC-2012, which further illustrates the effectiveness of the correlation between the image background and subjective perception in improving the performance of the visual analysis tasks.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.6
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据