☆ 4.6 Article

A maximum-entropy-attention-based convolutional neural network for image perception

NEURAL COMPUTING & APPLICATIONS (2023)

期刊

NEURAL COMPUTING & APPLICATIONS

卷 35, 期 12, 页码 8647-8655

出版社

SPRINGER LONDON LTD

DOI: 10.1007/s00521-022-07564-z

关键词

Machine learning; Image enhancement; Image processing; Feature extraction; Hybrid intelligence

类别

Computer Science, Artificial Intelligence

向作者/读者索取更多资源

Protocol

社区支持

Reagent

社区支持

智能总结 New
摘要

This paper proposes a maximal-entropy-attention-based convolutional neural network (MEA-CNN) framework that utilizes a maximum entropy algorithm for image feature pre-extraction and an attention mechanism to enhance key areas of an image. Experimental results show that the proposed framework achieves high testing accuracy in tasks such as traffic sign recognition and road surface condition monitoring, and the extracted features are more easily interpretable.

In recent years, image perception such as enhancement, classification and object detection with deep learning has achieved significant successes. However, in real world under extreme conditions, the training of a deep learning model often yields low accuracy, low efficiency in feature extraction and generalizability, due to the inner uncourteous and uninterpretable characteristics. In this paper, a maximal-entropy-attention-based convolutional neural network (MEA-CNN) framework is proposed. A maximum entropy algorithm is first used for image feature pre-extraction. An attention mechanism is then proposed by combining the extracted features on original images. By applying the mechanism, the key areas of an image are enhanced, and noised area can be ignored. Afterward, the processed images are transferred into region convolutional neural network, which is a well-known pre-trained CNN model, for further feature learning and extraction. Finally, two real-world experiments on traffic sign recognition and road surface condition monitoring are designed. The results show that the proposed framework has high testing accuracy, with improvements of 17% and 2.9%, compared with some other existing methods. In addition, the features extracted by the model are more easily interpretable.

A maximum-entropy-attention-based convolutional neural network for image perception

期刊

NEURAL COMPUTING & APPLICATIONS

出版社

SPRINGER LONDON LTD

关键词

类别

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

A maximum-entropy-attention-based convolutional neural network for image perception

期刊

NEURAL COMPUTING & APPLICATIONS

出版社

SPRINGER LONDON LTD

关键词

类别

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

导出引文

分享论文