4.6 Article

Evaluating the Performance of Speaker Recognition Solutions in E-Commerce Applications

期刊

SENSORS
卷 21, 期 18, 页码 -

出版社

MDPI
DOI: 10.3390/s21186231

关键词

speaker recognition; biometrics; e-commerce applications; identity management systems

资金

  1. Ministry of Education, Science and Technological Development of the Republic of Serbia [TR-32013]

向作者/读者索取更多资源

This paper focuses on user identity verification based on voice recognition techniques in e-commerce applications, evaluating the performance of open-source speaker recognition technologies. The results show that i-vectors and solutions based on probabilistic linear discriminant analysis (PLDA) remain dominant in speaker recognition for text-independent tasks.
Two important tasks in many e-commerce applications are identity verification of the user accessing the system and determining the level of rights that the user has for accessing and manipulating system's resources. The performance of these tasks is directly dependent on the certainty of establishing the identity of the user. The main research focus of this paper is user identity verification approach based on voice recognition techniques. The paper presents research results connected to the usage of open-source speaker recognition technologies in e-commerce applications with an emphasis on evaluating the performance of the algorithms they use. Four open-source speaker recognition solutions (SPEAR, MARF, ALIZE, and HTK) have been evaluated in cases of mismatched conditions during training and recognition phases. In practice, mismatched conditions are influenced by various lengths of spoken sentences, different types of recording devices, and the usage of different languages in training and recognition phases. All tests conducted in this research were performed in laboratory conditions using the specially designed framework for multimodal biometrics. The obtained results show consistency with the findings of recent research which proves that i-vectors and solutions based on probabilistic linear discriminant analysis (PLDA) continue to be the dominant speaker recognition approaches for text-independent tasks.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.6
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据