4.0 Article

Plug-and-play microphones for recording speech and voice with smart devices

Journal

FOLIA PHONIATRICA ET LOGOPAEDICA
Volume -, Issue -, Pages -

Publisher

KARGER
DOI: 10.1159/000535152

Keywords

Acoustic analysis; voice; speech; remote assessment; microphone

Ask authors/readers for more resources

Smart devices offer great potential for scalable voice assessment, but there is wide heterogeneity among acoustic metrics for their robustness to variation in recording devices. Consumer-grade plug-and-play microphones show biases towards high frequencies and noise when compared to reference microphones, and relative noise is closely related to microphone-to-mouth distance. While some acoustic metrics show good agreement between consumer-grade and reference microphones, others, particularly fine timing metrics, show poor agreement. Measurement of f0 and F2 slope are robust to variation in recording equipment. Plug-and-play microphones may be used for equipment standardization in speech studies, with some loss in accuracy and statistical power.
INTRODUCTION Smart devices are widely available and capable of quickly recording and uploading speech segments for health-related analysis. The switch from laboratory recordings with professional-grade microphone set ups to remote, smart device-based recordings offers immense potential for the scalability of voice assessment. Yet, a growing body of literature points to a wide heterogeneity among acoustic metrics for their robustness to variation in recording devices. The addition of consumer-grade plug-and-play microphones has been proposed as a possible solution. Our aim was to assess if the addition of consumer-grade plug-and-play microphones increase the acoustic measurement agreement between ultra-portable devices and a reference microphone. METHODS Speech was simultaneously recorded by a reference high-quality microphone commonly used in research, and by two configurations with plug-and-play microphones. Twelve speech-acoustic features were calculated using recordings from each microphone to determine the agreement intervals in measurements between microphones. Agreement intervals were then compared to expected deviations in speech in various neurological conditions. Each microphone's response to speech and to silence were characterized through acoustic analysis to explore possible reasons for differences in acoustic measurements between microphones. The statistical differentiation of two groups, neurotypical and people with Multiple Sclerosis, using metrics from each tested microphone was compared to that of the reference microphone. RESULTS The two consumer-grade plug-and-play microphones favoured high frequencies (mean centre of gravity difference >= +175.3Hz) and recorded more noise (mean difference in signal-to-noise <= -4.2dB) when compared to the reference microphone. Between consumer-grade microphones, differences in relative noise were closely related to distance between the microphone and the speaker's mouth. Agreement intervals between the reference and consumer-grade microphones remained under disease-expected deviations only for fundamental frequency (f0, agreement interval <= 0.06Hz), f0 instability (f0 CoV, agreement interval <= 0.05%) and for tracking of second formant movement (agreement interval <= 1.4Hz/millisecond). Agreement between microphones was poor for other metrics, particularly for fine timing metrics (mean pause length and pause length variability for various tasks). The statistical difference between the two groups of speakers was smaller with the plug-and-play than with the reference microphone. CONCLUSION Measurement of f0 and F2 slope were robust to variation in recording equipment while other acoustic metrics were not. Thus, the tested plug-and-play microphones should not be used interchangeably with professional-grade microphones for speech analysis. Plug-and-play microphones may assist in equipment standardization within speech studies, including remote or self-recording, possibly with small loss in accuracy and statistical power as observed in this study.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.0
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available