4.7 Article

Direction of arrival estimation for indoor environments based on acoustic composition model with a single microphone

相关参考文献

注意:仅列出部分参考文献,下载原文获取全部文献信息。
Article Computer Science, Artificial Intelligence

Deep learning based multi-source localization with source splitting and its effectiveness in multi-talker speech recognition

Aswin Shanmugam Subramanian et al.

Summary: This paper proposes a novel supervised learning method using deep neural networks for multi-source localization in multi-talker conversation analysis. The method utilizes a source splitting mechanism to estimate the direction of arrival (DOA) of all speakers simultaneously from the audio mixture. The proposed method outperforms existing deep learning methods by performing utterance level prediction and incorporating temporal selection and averaging inside the network. Experimental results demonstrate the effectiveness of a variant of earth mover distance (EMD) in classifying DOA at a high resolution. Moreover, the estimated DOAs are used as additional input features in a speech recognition baseline, significantly improving the recognition performance.

COMPUTER SPEECH AND LANGUAGE (2022)

Article Chemistry, Multidisciplinary

Fast Sound Source Localization Based on SRP-PHAT Using Density Peaks Clustering

De-Bing Zhuo et al.

Summary: This article proposes an improved sound source localization method ODB-SRP-PHAT, which first determines possible sound source positions through density peak clustering before real-time localization and stores them in an online database, reducing computational load significantly while maintaining high localization accuracy.

APPLIED SCIENCES-BASEL (2021)

Article Computer Science, Artificial Intelligence

Multimodal fusion for indoor sound source localization

Jinhui Chen et al.

Summary: This paper presents a novel solution based on fusing visual and acoustic models for indoor sound source localization. By employing two new approaches, the direction and distance of the sound source can be estimated stably, therefore improving the verification task of sound source localization.

PATTERN RECOGNITION (2021)

Article Computer Science, Artificial Intelligence

Detection of COVID-19 from speech signal using bio-inspired based cepstral features

Tusar Kanti Dash et al.

Summary: Early detection of COVID-19 is challenging due to its spreading nature and fear among people. Speech-based detection, focusing on coughing sounds, can improve efficiency. A new feature called COVID-19 Coefficient (C-19CC) is developed in this study for improved detection performance.

PATTERN RECOGNITION (2021)

Article Acoustics

Multiple Source Direction of Arrival Estimations Using Relative Sound Pressure Based MUSIC

Yonggang Hu et al.

Summary: The paper introduces a novel MUSIC algorithm designed for noisy environments using higher order microphone array measurements. Signal decomposition into spherical harmonics domain and frequency smoothing technique improve localization accuracy, with the added capability of estimating active sound source numbers. Experimental results show advantages over traditional MUSIC and another recent multi-source localization method.

IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING (2021)

Article Engineering, Multidisciplinary

A TDOA-based multiple source localization using delay density maps

Ritu Boora et al.

SADHANA-ACADEMY PROCEEDINGS IN ENGINEERING SCIENCES (2020)

Article Computer Science, Artificial Intelligence

Joint learning for voice based disease detection

Kebin Wu et al.

PATTERN RECOGNITION (2019)

Article Computer Science, Artificial Intelligence

Blind separation of temporally correlated noncircular sources using complex matrix joint diagonalization

Jiong Li et al.

PATTERN RECOGNITION (2019)

Article Computer Science, Artificial Intelligence

Real-Time monophonic and polyphonic audio classification from power spectra

Maxime Baelde et al.

PATTERN RECOGNITION (2019)

Article Acoustics

TDOA-Based Multiple Acoustic Source Localization Without Association Ambiguity

Harshavardhan Sundar et al.

IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING (2018)

Article Acoustics

Swarm Intelligence Based Particle Filter for Alternating Talker Localization and Tracking Using Microphone Arrays

Kai Wu et al.

IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING (2017)

Article Engineering, Electrical & Electronic

A Steered-Response Power Algorithm Employing Hierarchical Search for Acoustic Source Localization Using Microphone Arrays

Leonardo O. Nunes et al.

IEEE TRANSACTIONS ON SIGNAL PROCESSING (2014)

Article Acoustics

A Geometric Approach to Sound Source Localization from Time-Delay Estimates

Xavier Alameda-Pineda et al.

IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING (2014)

Article Acoustics

Independent Component Analysis Using Spherical Microphone Arrays

Nicolas Epain et al.

ACTA ACUSTICA UNITED WITH ACUSTICA (2012)

Article Acoustics

Generalized State Coherence Transform for Multidimensional TDOA Estimation of Multiple Sources

Francesco Nesta et al.

IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING (2012)

Article Acoustics

Localization of distinct reflections in rooms using spherical microphone array eigenbeam processing

Haohai Sun et al.

JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA (2012)

Article Engineering, Electrical & Electronic

Source localization for multiple speech sources using low complexity non-parametric source separation and clustering

M. Swartling et al.

SIGNAL PROCESSING (2011)

Article Acoustics

A Two Microphone-Based Approach for Source Localization of Multiple Speech Sources

Wenyi Zhang et al.

IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING (2010)

Article Engineering, Electrical & Electronic

Single-Channel Talker Localization Based on Discrimination of Acoustic Transfer Functions

Tetsuya Takiguchi et al.

EURASIP JOURNAL ON ADVANCES IN SIGNAL PROCESSING (2009)

Article Computer Science, Information Systems

Acoustic model adaptation using first-order linear prediction for reverberant speech

T Takiguchi et al.

IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS (2006)

Article Engineering, Electrical & Electronic

Time delay estimation in room acoustic environments: An overview

Jingdong Chen et al.

EURASIP JOURNAL ON APPLIED SIGNAL PROCESSING (2006)

Article Acoustics

Time-delay estimation via linear interpolation and cross correlation

J Benesty et al.

IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING (2004)

Article Acoustics

HMM-separation-based speech recognition for a distant moving speaker

T Takiguchi et al.

IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING (2001)