☆ 4.7 Article

Underwater target recognition using convolutional recurrent neural networks with 3-D Mel-spectrogram and data augmentation

APPLIED ACOUSTICS (2021)

Journal

APPLIED ACOUSTICS

Volume 178, Issue -, Pages -

Publisher

ELSEVIER SCI LTD

DOI: 10.1016/j.apacoust.2021.107989

Keywords

Underwater acoustic target recognition; Feature extraction; Mel-spectrogram; Data augmentation; Convolutional Recurrent Neural Networks

Funding

National Natural Science Foundation of China [41906169]

Ask authors/readers for more resources

Protocol

Community support

Reagent

Community support

Automated Summary New
Abstract

Passive recognition of underwater acoustic targets is a hot research issue, and combining deep learning and time-frequency spectrogram for feature extraction, as well as utilizing data augmentation and deep neural network, can effectively enhance the recognition accuracy. Experimental results on the ShipsEar dataset show that the proposed method achieves high recognition performance.

Passive recognition of underwater acoustic targets is a hot research issue in acoustic signal processing. The long-term interference of irregular noise in the marine environment caused the relevance of the passive recognition method of underwater targets based on the traditional technical framework to gradually decrease. Due to the interference of irregular noise in the ocean, the passive recognition method used for underwater targets based on the traditional technical framework is gradually becoming less relevant. The feature extraction method that combines deep learning and time-frequency spectrogram can better describe the differences of different targets. In this paper, the proposed model contains three steps to deal with the recognition of underwater targets: feature extraction, data augmentation and deep neural network. For the feature extraction, we use a Mel-spectrogram, as well as the delta and delta-delta features in order to construct 3-D features. In the data augmentation part, we expand the dataset with SpecAugment in the time domain and frequency domain. In deep neural network prediction part, we use the convolutional recurrent neural network (CRNN) for acoustic target recognition. Through a comparison with the ablation test, it is clear that the pipeline in our method is effective in acquiring the recognition result. After evaluating our system through the carrying out of three tasks on the ShipsEar dataset, and the recognition accuracy are 94.6%, 87.5% and 72.6% in task 1, task 2 and task 3 respectively. (C) 2021 Elsevier Ltd. All rights reserved.

Underwater target recognition using convolutional recurrent neural networks with 3-D Mel-spectrogram and data augmentation

Journal

APPLIED ACOUSTICS

Publisher

ELSEVIER SCI LTD

Keywords

Categories

Funding

Ask authors/readers for more resources

Protocol

Reagent

Authors

I am an author on this paper

Reviews

Primary Rating

Secondary Ratings

Novelty

Significance

Scientific rigor

Rate this paper

Recommended

Underwater target recognition using convolutional recurrent neural networks with 3-D Mel-spectrogram and data augmentation

Journal

APPLIED ACOUSTICS

Publisher

ELSEVIER SCI LTD

Keywords

Categories

Funding

Ask authors/readers for more resources

Protocol

Reagent

Authors

I am an author on this paper

Reviews

Primary Rating

Secondary Ratings

Novelty

Significance

Scientific rigor

Rate this paper

Recommended

Export Citation

Share Paper