Stability of Forensic Text Comparison System

Shunichi Ishihara, Susan Brown

    Research output: Contribution to conferencePaper


    This study investigates how the reliability of likelihood ratio (LR)-based forensic text comparison (FTC) systems is affected by the sampling variability regarding author numbers in databases. When 30–40 authors (each contributing two 4 kB documents) are included in each of the test, reference and calibration databases, the experimental results demonstrate: 1) the overall performance (validity) of the FTC system reaches the same level of performance as a system with 720 authors, and 2) the variability of the system performance (reliability) starts to converge. A similar trend can be observed regarding the magnitude of fluctuation in derived LRs. The variability of the overall system performance is mostly due to the large variability in calibration, not discrimination. Furthermore, FTC systems are more prone to instability when the dimension of the feature vector is high.
    Original languageEnglish
    Publication statusPublished - 2022
    EventThe 20th Annual Workshop of the Australasian Language Technology Association - Adelaide, SA
    Duration: 1 Jan 2022 → …


    ConferenceThe 20th Annual Workshop of the Australasian Language Technology Association
    Period1/01/22 → …


    Dive into the research topics of 'Stability of Forensic Text Comparison System'. Together they form a unique fingerprint.

    Cite this