Exploring sub-band cepstral distances for more robust speaker classification

Takashi Osanai, Yuko Kinoshita, Frantz Clermont

    Research output: Contribution to conferencePaper

    Abstract

    This paper presents the first of two-part exploration into the potential of parametric cepstral distance (PCD) as a forensic voice comparison feature, based on Japanese vowel data collected from 306 male native speakers under microphone and mobile transmission conditions. The behaviours of PCDs were closely examined by altering sub-band settings, and we found the behaviour of PCDs to correspond well to what is known about formants, which suggests that PCDs are relatable to articulatory gestures. Comparison between sub-band and full-band PCD revealed that limiting the band range to a specific frequency region makes the feature more robust against channel mismatch, encouraging further examination of this potential feature.
    Original languageEnglish
    Pages41-44
    Publication statusPublished - 2018
    Event17th Australasian International Conference on Speech Science and Technology - Sydney, Australia
    Duration: 1 Jan 2018 → …

    Conference

    Conference17th Australasian International Conference on Speech Science and Technology
    Period1/01/18 → …

    Fingerprint Dive into the research topics of 'Exploring sub-band cepstral distances for more robust speaker classification'. Together they form a unique fingerprint.

    Cite this