DA COSTA ABREU, Marjory and AGUIAR DE LIMA, T. (2021). Investigating the use of multiple languages for crisp and fuzzy speaker identification. In: 11th International Conference of Pattern Recognition Systems (ICPRS 2021). IET.
|
PDF
multiling_speaker_id_icprs.pdf - Accepted Version All rights reserved. Download (741kB) | Preview |
Abstract
The use of speech for system identification is an important and relevant topic. There are several ways of doing it, but most are dependent on the language the user speaks. However, if the idea is to create an all-inclusive and reliable system that uses speech as its input, we must take into account that people can and will speak different languages and have different accents. Thus, this research evaluates speaker identification systems on a multilingual setup. Our experiments are performed using three widely spoken languages which are Portuguese, English, and Chinese. Initial tests indicated the systems have certain robustness on multiple languages. Results with more languages decreases our accuracy, but our investigation suggests these impacts are related to the number of classes.
Item Type: | Book Section |
---|---|
Additional Information: | © 2021 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works. |
Identification Number: | https://doi.org/10.1049/icp.2021.1431 |
SWORD Depositor: | Symplectic Elements |
Depositing User: | Symplectic Elements |
Date Deposited: | 22 Jan 2021 16:22 |
Last Modified: | 21 Jan 2022 13:00 |
URI: | https://shura.shu.ac.uk/id/eprint/28009 |
Actions (login required)
View Item |
Downloads
Downloads per month over past year