DSpace Repository

LEAP Diarization System for the Second DIHARD Challenge

Show simple item record

dc.contributor.author Singh, P.
dc.contributor.author Vardhan, H.
dc.contributor.author Ganapathy, S.
dc.contributor.author Ahilan, K.
dc.date.accessioned 2021-03-15T08:20:12Z
dc.date.accessioned 2022-06-27T10:02:17Z
dc.date.available 2021-03-15T08:20:12Z
dc.date.available 2022-06-27T10:02:17Z
dc.date.issued 2019
dc.identifier.citation Singh, P., Vardhan, H., Ganapathy, S., & Kanagasundaram, A. (2019, January). LEAP diarization system for the second dihard challenge. In Proceedings of the 20th Annual Conference of the International Speech Communication Association (INTERSPEECH 2019): Crossroads of Speech and Language (pp. 983-987). International Speech Communication Association. en_US
dc.identifier.uri http://repo.lib.jfn.ac.lk/ujrr/handle/123456789/1893
dc.description.abstract This paper presents the LEAP System, developed for the Second DIHARD diarization Challenge. The evaluation data in the challenge is composed of multi-talker speech in restaurants, doctor-patient conversations, child language acquisition recordings in home environments and audio extracted YouTube videos. The LEAP system is developed using two types of embeddings, one based on i-vector representations and the other one based on x-vector representations. The initial diarization output obtained using agglomerative hierarchical clustering (AHC) done on the probabilistic linear discriminant analysis (PLDA) scores is refined using the Variational-Bayes hidden Markov model (VB-HMM) model. We propose a modified VBHMM model with posterior scaling which provides significant improvements in the final diarization error rate (DER). We also use a domain compensation on the i-vector features to reduce the mis-match between training and evaluation conditions. Using the proposed approaches, we obtain relative improvements in DER of about 7:1% relative for the best individual system over the DIHARD baseline system and about 13:7% relative for the final system combination on evaluation set. An analysis performed using the proposed posterior scaling method shows that scaling results in improved discrimination among theHMM states in the VB-HMM. en_US
dc.language.iso en en_US
dc.subject Speaker Diarization en_US
dc.subject i-vector en_US
dc.title LEAP Diarization System for the Second DIHARD Challenge en_US
dc.type Article en_US


Files in this item

This item appears in the following Collection(s)

Show simple item record