Investigating the use of multiple languages for crisp and fuzzy speaker identification

DA COSTA ABREU, Marjory and AGUIAR DE LIMA, T. (2021). Investigating the use of multiple languages for crisp and fuzzy speaker identification. In: 11th International Conference of Pattern Recognition Systems (ICPRS 2021). IET. [Book Section]

Documents
28009:566078
[thumbnail of multiling_speaker_id_icprs.pdf]
Preview
PDF
multiling_speaker_id_icprs.pdf - Accepted Version
Available under License All rights reserved.

Download (741kB) | Preview
Abstract
The use of speech for system identification is an important and relevant topic. There are several ways of doing it, but most are dependent on the language the user speaks. However, if the idea is to create an all-inclusive and reliable system that uses speech as its input, we must take into account that people can and will speak different languages and have different accents. Thus, this research evaluates speaker identification systems on a multilingual setup. Our experiments are performed using three widely spoken languages which are Portuguese, English, and Chinese. Initial tests indicated the systems have certain robustness on multiple languages. Results with more languages decreases our accuracy, but our investigation suggests these impacts are related to the number of classes.
More Information
Statistics

Downloads

Downloads per month over past year

View more statistics

Metrics

Altmetric Badge

Dimensions Badge

Share
Add to AnyAdd to TwitterAdd to FacebookAdd to LinkedinAdd to PinterestAdd to Email

Actions (login required)

View Item View Item