Investigating the use of multiple languages for crisp and fuzzy speaker identification

Tools

DA COSTA ABREU, Marjory and AGUIAR DE LIMA, T. (2021). Investigating the use of multiple languages for crisp and fuzzy speaker identification. In: 11th International Conference of Pattern Recognition Systems (ICPRS 2021). IET. [Book Section]

[+][-]

Documents

28009:566078

[+][-]

28009:566078

[thumbnail of multiling_speaker_id_icprs.pdf]

Preview

PDF
multiling_speaker_id_icprs.pdf - Accepted Version
Available under License All rights reserved.

Download (741kB) | Preview

Abstract

The use of speech for system identification is an important and relevant topic. There are several ways of doing it, but most are dependent on the language the user speaks. However, if the idea is to create an all-inclusive and reliable system that uses speech as its input, we must take into account that people can and will speak different languages and have different accents. Thus, this research evaluates speaker identification systems on a multilingual setup. Our experiments are performed using three widely spoken languages which are Portuguese, English, and Chinese. Initial tests indicated the systems have certain robustness on multiple languages. Results with more languages decreases our accuracy, but our investigation suggests these impacts are related to the number of classes.

More Information

Official URL:

https://ieeexplore.ieee.org/document/9569005

Additional Information:

© 2021 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works.

Event Location:

Curico, Chile

ISBN:

9781839534300

Identifiers

Identification Number:

10.1049/icp.2021.1431

ORCID for Marjory Da Costa Abreu: