DSpace Repository

A study on the effects of using short utterance length development data in the design of GPLDA speaker verification systems

Show simple item record

dc.contributor.author Ahilan, K.
dc.contributor.author Dean, D.
dc.contributor.author Sridharan, S.
dc.contributor.author Ghaemmaghami, H.
dc.contributor.author Fookes, C.
dc.date.accessioned 2021-03-16T05:14:16Z
dc.date.accessioned 2022-06-27T10:02:27Z
dc.date.available 2021-03-16T05:14:16Z
dc.date.available 2022-06-27T10:02:27Z
dc.date.issued 2017
dc.identifier.citation Kanagasundaram, A., Dean, D., Sridharan, S., Ghaemmaghami, H., & Fookes, C. (2017). A study on the effects of using short utterance length development data in the design of GPLDA speaker verification systems. International Journal of Speech Technology, 20(2), 247-259. en_US
dc.identifier.uri http://repo.lib.jfn.ac.lk/ujrr/handle/123456789/1913
dc.description.abstract This paper studies the performance degradation of Gaussian probabilistic linear discriminant analysis (GPLDA) speaker verification system, when only short-utterance data is used for speaker verification system development. Subsequently, a number of techniques, including utterance partitioning and source-normalised weighted linear discriminant analysis (SN-WLDA) projections are introduced to improve the speaker verification performance in such conditions. Experimental studies have found that when short utterance data is available for speaker verification development, GPLDA system overall achieves best performance with a lower number of universal background model (UBM) components. As a lower number of UBM components significantly reduces the computational complexity of speaker verification system, that is a useful observation. In limited session data conditions, we propose a simple utterance-partitioning technique, which when applied to the LDA-projected GPLDA system shows over 8% relative improvement on EER values over baseline system on NIST 2008 truncated 10–10 s conditions. We conjecture that this improvement arises from the apparent increase in the number of sessions arising from our partitioning technique and this helps to better model the GPLDA parameters. Further, partitioning SN-WLDA-projected GPLDA shows over 16% and 6% relative improvement on EER values over LDA-projected GPLDA systems respectively on NIST 2008 truncated 10–10 s interviewinterview, and NIST 2010 truncated 10–10 s interviewinterview and telephone-telephone conditions. en_US
dc.language.iso en en_US
dc.publisher Springer en_US
dc.title A study on the effects of using short utterance length development data in the design of GPLDA speaker verification systems en_US
dc.type Article en_US


Files in this item

This item appears in the following Collection(s)

Show simple item record