DSpace Repository

Evaluating Deep Neural Network-based Speaker Verification Systems on Sinhala and Tamil Datasets

Show simple item record

dc.contributor.author Anuraj, S.P.D.
dc.contributor.author Jarashanth, S.T.
dc.contributor.author Ahilan, K.
dc.contributor.author Valluvan, R.
dc.contributor.author Thiruvaran, T.
dc.contributor.author Kaneswaran, A.
dc.date.accessioned 2023-02-17T07:02:08Z
dc.date.available 2023-02-17T07:02:08Z
dc.date.issued 2022
dc.identifier.uri http://repo.lib.jfn.ac.lk/ujrr/handle/123456789/9176
dc.description.abstract Speaker verification, a biometric identifier, determines whether an input speech belongs to the claimed identity. The existing models for speaker verification have reported performances mainly in English, and no study has experimented with Sinhala and Tamil datasets. This study proposes a semi-automated pipeline to curate datasets for Sinhala and Tamil from videos on YouTube filmed under noisy and unconstrained conditions which represent real-world scenarios. Both Sinhala and Tamil datasets include utterances for 140 persons of interest (POIs) with more than 300 utterances per POI under one or more genres: interviews, speeches, and vlogs. Moreover, this study investigates how domain mismatch affects a speaker verification model trained in English and applied to Sinhala and Tamil. Two deep neural network models trained in English show significant performance drops on Sinhala and Tamil datasets compared to an English dataset as expected due to domain mismatch, however, it is observed that AM-softmax performed better than vanilla softmax. In the future, robust speaker verification models with domain adaptation techniques will be built to improve performance on Sinhala and Tamil datasets. en_US
dc.language.iso en en_US
dc.publisher IEEE en_US
dc.subject Speaker Verification en_US
dc.subject Sinhala en_US
dc.subject Tamil en_US
dc.subject Dataset en_US
dc.subject ResNet en_US
dc.subject Deep neural networks en_US
dc.title Evaluating Deep Neural Network-based Speaker Verification Systems on Sinhala and Tamil Datasets en_US
dc.type Article en_US


Files in this item

This item appears in the following Collection(s)

Show simple item record