This study sets out to find the most reliable method for loglikelihood-ratio (LLR) calculation under severe data scarcity, which is typical of forensic voice comparison casework. We compared the performances of three types of speaker modelling, namely a single Gaussian model, Gaussian Mixture Models (GMM) of different complexity, and a Multivariate Kernel Density Model (MVKD), using two and threedimensional formant frequency feature vectors extracted from /iː/ vowels. We varied the number of tokens used in the offender dataset from 2 to 6. We find that calibration of the systems was critical for dependable evaluation with all the systems tested and that the MVKD model outperformed Gaussian models in most cases.
|Publication status||Published - 2014|
|Event||Annual Conference of the International Speech Communication Association INTERSPEECH 2014 - Singapore|
Duration: 1 Jan 2014 → …
|Conference||Annual Conference of the International Speech Communication Association INTERSPEECH 2014|
|Period||1/01/14 → …|