期刊
PATTERN RECOGNITION
卷 129, 期 -, 页码 -出版社
ELSEVIER SCI LTD
DOI: 10.1016/j.patcog.2022.108715
关键词
Gaussian mixture model (GMM); Acoustic transfer function (ATF); Talker localization; Gaussian mixture model (GMM); Acoustic transfer function (ATF); Talker localization
资金
- JSPS KAKENHI [17H01995, 19H00597]
- National Natural Science Founda-tion of China [62176227, U20 6 6213, 6186020 6004]
- Grants-in-Aid for Scientific Research [19H00597, 17H01995] Funding Source: KAKEN
This paper presents an effective method for multi-talker localization using only a single microphone in a room, which can successfully and accurately process the localization task. Experiments demonstrate the effectiveness of the proposed method.
This paper presents an effective method for multiple talker localisation using only a single microphone in a room. One of the main challenge here is obtaining a model that can be used for estimating the localization parameter. This model must be sensitive to all possible speaker locations and correctly dis-criminate their positions. The reverberant speech signal in a room environment can be composited by the clean speech and the acoustic transfer function (ATF). The ATF is a useful tool to describe changes of the speech source, and the approaches based on ATF can thus be used to identify talker localizations with a single microphone. This paper presents two methods, referred to as Composite Reverberant Speech (CRS) model and Direct Training Reverberant Speech (DTRS) model, and uses these methods for obtaining the ATF of a room. The approaches based on proposed methods can successfully and accurately process multi-talker localization task with single microphone. Experiments also demonstrate the effectiveness of the proposed methods.(c) 2022 Elsevier Ltd. All rights reserved.
作者
我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。
推荐
暂无数据