Abstract:
In automatic forensic speaker recognition research two
frameworks, namely single-stage and two-stage modelling, are
used. Although both have their own strengths and limitations,
performance is an important attribute that needs to be
considered when selecting these methods for forensic research.
This paper compares a calibrated single-stage system with a
calibrated two-stage system using a common database in terms of
different performance metrics. Neither of the systems provides a
consistent advantage over the other in terms of all performance
measures, raising the question of whether the use of a two-stage
system, which requires additional data and effort, is warranted.