Developing ASR for Indonesian-English Bilingual Language Teaching

Zara Maxwell-Smith, Ben Foley

    Research output: Contribution to conferencePaper


    Usage-based analyses of teacher corpora and code-switching (Boztepe, 2003) are an important next stage in understanding language acquisition. Multilingual corpora are difficult to compile and a classroom setting adds pedagogy to the mix of factors which make this data so rich and problematic to classify. Using quantitative methods to understand language learning and teaching is difficult work as the ‘transcription bottleneck’ constrains the size of datasets. We found that using an automatic speech recognition (ASR) toolkit with a small set of training data is likely to speed data collection in this context (Maxwelll-Smith et al., 2020).
    Original languageEnglish
    Publication statusPublished - 2021
    EventComputational Approaches to Linguistic Code-Switching - Mexico City
    Duration: 1 Jan 2021 → …


    ConferenceComputational Approaches to Linguistic Code-Switching
    Period1/01/21 → …


    Dive into the research topics of 'Developing ASR for Indonesian-English Bilingual Language Teaching'. Together they form a unique fingerprint.

    Cite this