4.7 Article

Underwater target recognition using convolutional recurrent neural networks with 3-D Mel-spectrogram and data augmentation

Journal

APPLIED ACOUSTICS
Volume 178, Issue -, Pages -

Publisher

ELSEVIER SCI LTD
DOI: 10.1016/j.apacoust.2021.107989

Keywords

Underwater acoustic target recognition; Feature extraction; Mel-spectrogram; Data augmentation; Convolutional Recurrent Neural Networks

Categories

Funding

  1. National Natural Science Foundation of China [41906169]

Ask authors/readers for more resources

Passive recognition of underwater acoustic targets is a hot research issue, and combining deep learning and time-frequency spectrogram for feature extraction, as well as utilizing data augmentation and deep neural network, can effectively enhance the recognition accuracy. Experimental results on the ShipsEar dataset show that the proposed method achieves high recognition performance.
Passive recognition of underwater acoustic targets is a hot research issue in acoustic signal processing. The long-term interference of irregular noise in the marine environment caused the relevance of the passive recognition method of underwater targets based on the traditional technical framework to gradually decrease. Due to the interference of irregular noise in the ocean, the passive recognition method used for underwater targets based on the traditional technical framework is gradually becoming less relevant. The feature extraction method that combines deep learning and time-frequency spectrogram can better describe the differences of different targets. In this paper, the proposed model contains three steps to deal with the recognition of underwater targets: feature extraction, data augmentation and deep neural network. For the feature extraction, we use a Mel-spectrogram, as well as the delta and delta-delta features in order to construct 3-D features. In the data augmentation part, we expand the dataset with SpecAugment in the time domain and frequency domain. In deep neural network prediction part, we use the convolutional recurrent neural network (CRNN) for acoustic target recognition. Through a comparison with the ablation test, it is clear that the pipeline in our method is effective in acquiring the recognition result. After evaluating our system through the carrying out of three tasks on the ShipsEar dataset, and the recognition accuracy are 94.6%, 87.5% and 72.6% in task 1, task 2 and task 3 respectively. (C) 2021 Elsevier Ltd. All rights reserved.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available