Please use this identifier to cite or link to this item: http://repo.lib.jfn.ac.lk/ujrr/handle/123456789/1898
Full metadata record
DC FieldValueLanguage
dc.contributor.authorAhilan, K.
dc.contributor.authorDean, D.
dc.contributor.authorSridharan, S.
dc.contributor.authorLaren, M.M.
dc.contributor.authorVogt, R.
dc.date.accessioned2021-03-16T02:28:40Z
dc.date.accessioned2022-06-27T10:02:19Z-
dc.date.available2021-03-16T02:28:40Z
dc.date.available2022-06-27T10:02:19Z-
dc.date.issued2014
dc.identifier.citationKanagasundaram, A., Dean, D., Sridharan, S., McLaren, M., & Vogt, R. (2014). I-vector based speaker recognition using advanced channel compensation techniques. Computer Speech & Language, 28(1), 121-140.en_US
dc.identifier.urihttp://repo.lib.jfn.ac.lk/ujrr/handle/123456789/1898-
dc.description.abstractThis paper investigates advanced channel compensation techniques for the purpose of improving i-vector speaker verification performance in the presence of high intersession variability using the NIST 2008 and 2010 SRE corpora. The performance of four channel compensation techniques: (a) weighted maximum margin criterion (WMMC), (b) source-normalized WMMC (SN-WMMC), (c) weighted linear discriminant analysis (WLDA) and (d) sourcenormalized WLDA (SN-WLDA) have been investigated. We show that, by extracting the discriminatory information between pairs of speakers as well as capturing the source variation information in the development i-vector space, the SN-WLDA based cosine similarity scoring (CSS) i-vector system is shown to provide over 20% improvement in EER for NIST 2008 interview and microphone verification and over 10% improvement in EER for NIST 2008 telephone verification, when compared to SN-LDA based CSS i-vector system. Further, score-level fusion techniques are analyzed to combine the best channel compensation approaches, to provide over 8% improvement in DCF over the best single approach, (SN-WLDA), for NIST 2008 interview/ telephone enrolment-verification condition. Finally, we demonstrate that the improvements found in the context of CSS also generalize to state-of-the-art GPLDA with up to 14% relative improvement in EER for NIST SRE 2010 interview and microphone verification and over 7% relative improvement in EER for NIST SRE 2010 telephone verification.en_US
dc.language.isoenen_US
dc.subjectSpeaker verificationen_US
dc.subjectI-vectoren_US
dc.titleI-vector based Speaker Recognition Using Advanced Channel Compensation Techniquesen_US
dc.typeArticleen_US
Appears in Collections:Electrical & Electronic Engineering

Files in This Item:
File Description SizeFormat 
I-vector based Speaker Recognition Using Advanced Channel.pdf128.43 kBAdobe PDFThumbnail
View/Open


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.