3.8 Proceedings Paper

Disentangling the Impacts of Language and Channel Variability on Speech Separation Networks

Journal

INTERSPEECH 2022
Volume -, Issue -, Pages 5343-5347

Publisher

ISCA-INT SPEECH COMMUNICATION ASSOC
DOI: 10.21437/Interspeech.2022-509

Keywords

speech separation; domain mismatch

Ask authors/readers for more resources

Because of the excellent performance of speech separation in cases of complete speaker overlap, the focus of research has shifted towards dealing with more realistic scenarios. However, domain mismatch between training and testing situations remains a significant problem due to various factors. This study investigates the impacts of language and channel mismatches on speech separation and proposes a new solution for channel mismatch using projection evaluation.
Because the performance of speech separation is excellent for speech in which two speakers completely overlap, research attention has been shifted to dealing with more realistic scenarios. However, domain mismatch between training/test situations due to factors, such as speaker, content, channel, and environment, remains a severe problem for speech separation. Speaker and environment mismatches have been studied in the existing literature. Nevertheless, there are few studies on speech content and channel mismatches. Moreover, the impacts of language and channel in these studies are mostly tangled. In this study, we create several datasets for various experiments. The results show that the impacts of different languages are small enough to be ignored compared to the impacts of different channels. In our experiments, training on data recorded by Android phones leads to the best generalizability. Moreover, we provide a new solution for channel mismatch by evaluating projection, where the channel similarity can be measured and used to effectively select additional training data to improve the performance of in-the-wild test data.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

3.8
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available