Abstract:
This paper investigates the use of mel-frequency deltaphase
(MFDP) features in comparison to, and in fusion with,
traditional mel-frequency cepstral coefficient (MFCC) features
within joint factor analysis (JFA) speaker verification. MFCC
features, commonly used in speaker recognition systems, are
derived purely from the magnitude spectrum, with the phase
spectrum completely discarded. In this paper, we investigate if
features derived from the phase spectrum can provide additional
speaker discriminant information to the traditional MFCC approach
in a JFA based speaker verification system. Results are
presented which provide a comparison of MFCC-only, MFDPonly
and score fusion of the two approaches within a JFA
speaker verification approach. Based upon the results presented
using the NIST 2008 Speaker Recognition Evaluation (SRE)
dataset, we believe that, while MFDP features alone cannot
compete with MFCC features, MFDP can provide complementary
information that result in improved speaker verification performance
when both approaches are combined in score fusion,
particularly in the case of shorter utterances.