4.6 Article

A maximum-entropy-attention-based convolutional neural network for image perception

Journal

NEURAL COMPUTING & APPLICATIONS
Volume 35, Issue 12, Pages 8647-8655

Publisher

SPRINGER LONDON LTD
DOI: 10.1007/s00521-022-07564-z

Keywords

Machine learning; Image enhancement; Image processing; Feature extraction; Hybrid intelligence

Ask authors/readers for more resources

This paper proposes a maximal-entropy-attention-based convolutional neural network (MEA-CNN) framework that utilizes a maximum entropy algorithm for image feature pre-extraction and an attention mechanism to enhance key areas of an image. Experimental results show that the proposed framework achieves high testing accuracy in tasks such as traffic sign recognition and road surface condition monitoring, and the extracted features are more easily interpretable.
In recent years, image perception such as enhancement, classification and object detection with deep learning has achieved significant successes. However, in real world under extreme conditions, the training of a deep learning model often yields low accuracy, low efficiency in feature extraction and generalizability, due to the inner uncourteous and uninterpretable characteristics. In this paper, a maximal-entropy-attention-based convolutional neural network (MEA-CNN) framework is proposed. A maximum entropy algorithm is first used for image feature pre-extraction. An attention mechanism is then proposed by combining the extracted features on original images. By applying the mechanism, the key areas of an image are enhanced, and noised area can be ignored. Afterward, the processed images are transferred into region convolutional neural network, which is a well-known pre-trained CNN model, for further feature learning and extraction. Finally, two real-world experiments on traffic sign recognition and road surface condition monitoring are designed. The results show that the proposed framework has high testing accuracy, with improvements of 17% and 2.9%, compared with some other existing methods. In addition, the features extracted by the model are more easily interpretable.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.6
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available