4.6 Article

A maximum-entropy-attention-based convolutional neural network for image perception

期刊

NEURAL COMPUTING & APPLICATIONS
卷 35, 期 12, 页码 8647-8655

出版社

SPRINGER LONDON LTD
DOI: 10.1007/s00521-022-07564-z

关键词

Machine learning; Image enhancement; Image processing; Feature extraction; Hybrid intelligence

向作者/读者索取更多资源

This paper proposes a maximal-entropy-attention-based convolutional neural network (MEA-CNN) framework that utilizes a maximum entropy algorithm for image feature pre-extraction and an attention mechanism to enhance key areas of an image. Experimental results show that the proposed framework achieves high testing accuracy in tasks such as traffic sign recognition and road surface condition monitoring, and the extracted features are more easily interpretable.
In recent years, image perception such as enhancement, classification and object detection with deep learning has achieved significant successes. However, in real world under extreme conditions, the training of a deep learning model often yields low accuracy, low efficiency in feature extraction and generalizability, due to the inner uncourteous and uninterpretable characteristics. In this paper, a maximal-entropy-attention-based convolutional neural network (MEA-CNN) framework is proposed. A maximum entropy algorithm is first used for image feature pre-extraction. An attention mechanism is then proposed by combining the extracted features on original images. By applying the mechanism, the key areas of an image are enhanced, and noised area can be ignored. Afterward, the processed images are transferred into region convolutional neural network, which is a well-known pre-trained CNN model, for further feature learning and extraction. Finally, two real-world experiments on traffic sign recognition and road surface condition monitoring are designed. The results show that the proposed framework has high testing accuracy, with improvements of 17% and 2.9%, compared with some other existing methods. In addition, the features extracted by the model are more easily interpretable.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.6
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据