On the adequacy of L2 pronunciation feedback from automatic speech recognition: A focus on Google Translate


Paul John, Walcir Cardoso, Carol Johnson (2022)

In: Proceedings of the XXIst International CALL Research Conference (Chapter 20)

DOI: 10.29140/9781914291050-20


Abstract

This study investigates automatic speech recognition (ASR) in Google Translate as a source for L2 pronunciation feedback. To be effective, ASR should transcribe learner errors accurately and perform equally well on male and female voices, avoiding gender bias. We assess Google Translate on three Quebec francophone (QF) segmental errors in English: th-substitution ( think → [t] ink ); h-deletion ( happy → appy ); and h-epenthesis ( ice → [h] ice ). Eight QFs (4F/4M) recorded 120 sentences with and without an error on the final item (e.g., I don’t know who to *tank/thank ). Errors were equally divided between real word output (* tank ) and nonword output (e.g., My sister is afraid of *tunder ). We anticipate real word errors, corresponding to entries in the Google Translate lexicon, will be accurately transcribed, whereas nonwords, by definition absent from the lexicon, should be erroneously matched to similar-sounding real words (i.e., the intended output “thunder”), constituting misleading feedback. Forthcoming data analyses will determine the relative contribution of error type, real/nonword output, and gender to final-word transcription and feedback accuracy. Preliminary findings suggest a hierarchy of accuracy (h-deletion, h-epenthesis ˃th-substitution) specific to real-word output. Indeed, ASR shows a clear inability to flag nonword errors. A gender bias effect is not apparent; in fact, ASR generally transcribed the sentences recorded by females more accurately. Mistranscriptions unrelated to final items have yet to be examined. Our presentation will address the implications of our findings for L2 teachers/learners and for developers seeking to design ASR specifically for L2 uses.

Suggested citation

Paul John, Walcir Cardoso, Carol Johnson. (2022). On the adequacy of L2 pronunciation feedback from automatic speech recognition: A focus on Google Translate. In J. Colpaert, Y. Wang, & G. Stockwell (Eds.), Proceedings of the XXIst International CALL Research Conference (pp. 146–154). Castledown Publishers. https://doi.org/10.29140/9781914291050-20



Looking to publish your next academic book?


Related Publications

Online Language Teaching: Crises and Creativities

Online Language Teaching: Crises and Creativities

Edited by Ursula Stickler & Martina Emke (2023) Online Language Teaching: Crises and Creativities collects…
Insights into Teaching and Learning Writing

Insights into Teaching and Learning Writing

Edited by Hassan Mohebbi  & Yijen Wang  (2023) Writing is one of the most challenging skills…
Insights into Autonomy and Technology in Language Teaching

Insights into Autonomy and Technology in Language Teaching

Chun Lai  (2023) Drawing on the literature and research findings from relevant research fields, including…
Insights into Flipped Classrooms

Insights into Flipped Classrooms

Adrian Leis  (2023) This book is aimed at those interested in the flipped learning model as…
Insights into Task-Based Language Teaching

Insights into Task-Based Language Teaching

Sima Khezrlou  (2022) This book aims to offer a unique contribution to the expanding literature on…
Proceedings of the XXIst International CALL Research Conference

Proceedings of the XXIst International CALL Research Conference

Edited by Jozef Colpaert , Yijen Wang , Glenn Stockwell  (2022) The theme of the XXIst International CALL…
Insights into Professional Development in Language Teaching

Insights into Professional Development in Language Teaching

Thomas S.C. Farrell  (2022) From entering a teacher education program for initial qualifications onwards, language teachers…
Smart CALL: Personalization, Contextualization, & Socialization

Smart CALL: Personalization, Contextualization, & Socialization

Edited by Jozef Colpaert , Glenn Stockwell  (2022) The contributions in this volume focus on a…
%d