TY - JOUR
T1 - Australian English Bilingual Corpus: Automatic forced-alignment accuracy in Russian and English
AU - Gnevsheva, Ksenia
AU - Gonzalez Ochoa, Simon
AU - Fromont, Robert
PY - 2020
Y1 - 2020
N2 - This paper introduces the Australian English Bilingual Corpus, a Russian–English spoken corpus, and uses it for a comparison of automatic time alignment between two different languages. Automatic forced alignment is gaining popularity in corpus research as it allows for time-efficient processing of phonetic information. The Language, Brain and Behaviour: Corpus Analysis Tool is one aligner which compares well with others in terms of alignment accuracy. Most of the forced-alignment work has been done with different varieties of English. This paper compares alignment accuracy between Russian and English and discusses aligner settings and data characteristics that affect it. The results suggest higher alignment accuracy for English than Russian. For Russian, alignment accuracy improves with stress specification; that is, when stressed and unstressed vowels are treated as separate categories.
AB - This paper introduces the Australian English Bilingual Corpus, a Russian–English spoken corpus, and uses it for a comparison of automatic time alignment between two different languages. Automatic forced alignment is gaining popularity in corpus research as it allows for time-efficient processing of phonetic information. The Language, Brain and Behaviour: Corpus Analysis Tool is one aligner which compares well with others in terms of alignment accuracy. Most of the forced-alignment work has been done with different varieties of English. This paper compares alignment accuracy between Russian and English and discusses aligner settings and data characteristics that affect it. The results suggest higher alignment accuracy for English than Russian. For Russian, alignment accuracy improves with stress specification; that is, when stressed and unstressed vowels are treated as separate categories.
U2 - 10.1080/07268602.2020.1737507
DO - 10.1080/07268602.2020.1737507
M3 - Article
SN - 0726-8602
VL - 40
SP - 182
EP - 193
JO - Australian Journal of Linguistics
JF - Australian Journal of Linguistics
IS - 2
ER -