Investigating deep neural networks for speaker diarization in the Dihard challenge

Please use this identifier to cite or link to this item: http://repo.lib.jfn.ac.lk/ujrr/handle/123456789/1888

Title:	Investigating deep neural networks for speaker diarization in the Dihard challenge
Authors:	Himawan, I. Rahman, M.H. Sridharan, S. Fookes, C. Ahilan, K.
Keywords:	DIHARD challenge;speaker diarization
Issue Date:	2018
Publisher:	IEEE
Citation:	Himawan, I., Rahman, M. H., Sridharan, S., Fookes, C., & Kanagasundaram, A. (2018, December). Investigating deep neural networks for speaker diarization in the dihard challenge. In 2018 IEEE Spoken Language Technology Workshop (SLT) (pp. 1029-1035). IEEE.
Abstract:	We investigate the use of deep neural networks (DNNs) for the speaker diarization task to improve performance under domain mismatched conditions. Three unsupervised domain adaptation techniques, namely inter-dataset variability compensation (IDVC), domain-invariant covariance normalization (DICN), and domain mismatch modeling (DMM), are applied on DNN based speaker embeddings to compensate for the mismatch in the embedding subspace. We present results conducted on the DIHARD data, which was released for the 2018 diarization challenge. Collected from a diverse set of domains, this data provides very challenging domain mismatched conditions for the diarization task. Our results provide insights into how the performance of our proposed system could be further improved.
URI:	http://repo.lib.jfn.ac.lk/ujrr/handle/123456789/1888
Appears in Collections:	Electrical & Electronic Engineering

Files in This Item:

File	Description	Size	Format
INVESTIGATING DEEP NEURAL NETWORKS FOR SPEAKER DIARIZATION IN THE.pdf		41.01 kB	Adobe PDF	View/Open