DSpace Repository

Improving the performance of GPLDA speaker verification using unsupervised inter‑dataset variability compensation approaches

Show simple item record

dc.contributor.author Ahilan, K.
dc.date.accessioned 2021-03-16T02:42:43Z
dc.date.accessioned 2022-06-27T10:02:24Z
dc.date.available 2021-03-16T02:42:43Z
dc.date.available 2022-06-27T10:02:24Z
dc.date.issued 2018
dc.identifier.citation Kanagasundaram, A. (2018). Improving the performance of GPLDA speaker verification using unsupervised inter-dataset variability compensation approaches. International Journal of Speech Technology, 21(3), 533-544. en_US
dc.identifier.uri http://repo.lib.jfn.ac.lk/ujrr/handle/123456789/1900
dc.description.abstract In practical applications, speaker verification systems have to be developed and trained using data which is outside the domain of the intended application as the collection of significant amount of in-domain data could be difficult. Experimental studies have found that when a GPLDA system is trained using out-domain data, it significantly affects the speaker verification performance due to the mismatch between development data and evaluation data. This paper proposes several unsupervised inter-dataset variability compensation approaches for the purpose of improving the performance of GPLDA systems trained using out-domain data. We show that when GPLDA is trained using out-domain data, we can improve the performance by as much as 39% by using by score normalisation using small amounts of in-domain data. Also in situations where rich out-domain data and only limited in-domain data are available, a pooled-linear-weighted technique to estimate the GPLDA parameters shows 35% relative improvements in equal error rate (EER) on int–int conditions. We also propose a novel inter-dataset covariance normalization (IDCN) approach to overcome in- and out-domain data mismatch problem. Our unsupervised IDCN-compensated GPLDA system shows 14 and 25% improvement respectively in EER over out-domain GPLDA speaker verification on tel–tel and int–int training–testing conditions. We provide intuitive explanations as to why these inter-dataset variability compensation approaches provide improvements to speaker verification accuracy. en_US
dc.language.iso en en_US
dc.publisher Springer en_US
dc.subject Speaker recognition en_US
dc.subject i-Vectors en_US
dc.title Improving the performance of GPLDA speaker verification using unsupervised inter‑dataset variability compensation approaches en_US
dc.type Article en_US


Files in this item

This item appears in the following Collection(s)

Show simple item record