Please use this identifier to cite or link to this item: http://repo.lib.jfn.ac.lk/ujrr/handle/123456789/9177
Full metadata record
DC FieldValueLanguage
dc.contributor.authorJarashanth, S.T.-
dc.contributor.authorAhilan, K.-
dc.contributor.authorValluvan, R.-
dc.contributor.authorThiruvaran, T.-
dc.contributor.authorKaneswaran, A.-
dc.date.accessioned2023-02-17T07:08:11Z-
dc.date.available2023-02-17T07:08:11Z-
dc.date.issued2022-
dc.identifier.urihttp://repo.lib.jfn.ac.lk/ujrr/handle/123456789/9177-
dc.description.abstractSpeaker diarization is the task of partitioning a speech signal into homogeneous segments corresponding to speaker identities. We introduce a Tamil test dataset, considering that the existing literature on speaker diarization has experimented with English to a great extent; however, none on a Tamil dataset. An overlapped speech segment is a part of an audio clip where two or more speakers speak simultaneously. Overlapped speech regions degrade the performance of a speaker diarization system proportionally due to the complexity of identifying individual speakers. This study proposes an overlapped speech detection (OSD) model by discarding the non-speech segments and feeding speech segments into a Convolutional Recurrent Neural Network model as a binary classifier: single speaker speech and overlapped speech. The OSD model is integrated into a speaker diarizer, and the performance gain on the standard VoxConverse and our Tamil datasets in terms of Diarization Error Rate are 5.6% and 13.4%, respectively.en_US
dc.language.isoenen_US
dc.publisherIEEEen_US
dc.subjectOverlapped speech detectionen_US
dc.subjectSpeaker diarizationen_US
dc.subjectConvolutional recurrent neural networken_US
dc.subjectBinary classifieren_US
dc.subjectTamil dataseten_US
dc.titleOverlapped Speech Detection for Improved Speaker Diarization on Tamil Dataseten_US
dc.typeArticleen_US
Appears in Collections:Electrical & Electronic Engineering

Files in This Item:
File Description SizeFormat 
Overlapped Speech Detection for Improved Speaker.pdf248.51 kBAdobe PDFView/Open


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.