Automated scoring of speaking items in an assessment for teachers of English as a Foreign Language

This paper describes an end-to-end prototype system for automated scoring of spoken responses in a novel assessment for teachers of English as a Foreign Language who are not native speakers of English. The 21 speaking items contained in the assessment elicit both restricted and moderately restricted responses, and their aim is to assess the essential speaking skills that English teachers need in order to be effective communicators in their classrooms. Our system consists of a state-of-the-art automatic speech recognizer; multiple feature generation modules addressing diverse aspects of speaking proficiency, such as fluency, pronunciation, prosody, grammatical accuracy, and content accuracy; a filter that identifies and flags problematic responses; and linear regression models that predict response scores based on subsets of the features. The automated speech scoring system was trained and evaluated on a data set involving about 1,400 test takers, and achieved a speaker-level correlation (when scores for all 21 responses of a speaker are aggregated) with human expert scores of 0.73.

[1]  Jian Cheng,et al.  Validating automated speaking tests , 2010 .

[2]  Vassilios Digalakis,et al.  Combination of machine scores for automatic grading of pronunciation quality , 2000, Speech Commun..

[3]  Su-Youn Yoon,et al.  Vocabulary Profile as a Measure of Vocabulary Sophistication , 2012, BEA@NAACL-HLT.

[4]  Xiaoming Xi,et al.  Improved pronunciation features for construct-driven assessment of non-native spontaneous speech , 2009, HLT-NAACL.

[5]  Su-Youn Yoon,et al.  Assessment of ESL Learners' Syntactic Competence Based on Similarity Measures , 2012, EMNLP-CoNLL.

[6]  Maxine Eskénazi,et al.  An overview of spoken language technology for education , 2009, Speech Commun..

[7]  Paul Boersma,et al.  Praat, a system for doing phonetics by computer , 2002 .

[8]  Xiaoming Xi,et al.  Automatic scoring of non-native spontaneous speech in tests of spoken English , 2009, Speech Commun..

[9]  Helmer Strik,et al.  Automatic evaluation of Dutch pronunciation by using speech recognition technology , 1997, 1997 IEEE Workshop on Automatic Speech Recognition and Understanding Proceedings.

[10]  L. Boves,et al.  Quantitative assessment of second language learners' fluency by means of automatic speech recognition technology. , 2000, The Journal of the Acoustical Society of America.

[11]  Xiaoming Xi,et al.  A three-stage approach to the automated scoring of spontaneous spoken responses , 2011, Comput. Speech Lang..

[12]  Jian Cheng,et al.  Fluency and structural complexity as predictors of L2 oral proficiency , 2010, INTERSPEECH.

[13]  Mitch Weintraub,et al.  Automatic scoring of pronunciation quality , 2000, Speech Commun..

[14]  Xiaoming Xi,et al.  Evaluating prosodic features for automated scoring of non-native read speech , 2011, 2011 IEEE Workshop on Automatic Speech Recognition & Understanding.

[15]  Jack Mostow,et al.  A Prototype Reading Coach that Listens , 1994, AAAI.

[16]  Su-Youn Yoon,et al.  Application of Structural Events Detected on ASR Outputs for Automated Speaking Assessment , 2012, INTERSPEECH.

[17]  Xiaoming Xi,et al.  AUTOMATED SCORING OF SPONTANEOUS SPEECH USING SPEECHRATERSM V1.0 , 2008 .

[18]  Klaus Zechner,et al.  Computing and Evaluating Syntactic Complexity Features for Automated Scoring of Spontaneous Non-Native Speech , 2011, ACL.

[19]  Klaus Zechner,et al.  Exploring Content Features for Automated Speech Scoring , 2012, HLT-NAACL.

[20]  Ashish Verma,et al.  Sensei: Spoken language assessment for call center agents , 2007, 2007 IEEE Workshop on Automatic Speech Recognition & Understanding (ASRU).

[21]  Su-Youn Yoon,et al.  Non-scorable Response Detection for Automated Speaking Proficiency Assessment , 2011, BEA@ACL.

[22]  L. Boves,et al.  Quantitative assessment of second language learners' fluency: comparisons between read and spontaneous speech. , 2002, The Journal of the Acoustical Society of America.

[23]  Silke M. Witt,et al.  Use of speech recognition in computer-assisted language learning , 2000 .

[24]  F. June Automatic Assessment of Non-Native Prosody – Annotation , Modelling and Evaluation , 2012 .

[25]  Klaus Zechner,et al.  Automated Content Scoring of Spoken Responses in an Assessment for Teachers of English , 2013, BEA@NAACL-HLT.

[26]  Xiaoming Xi,et al.  Automated Scoring of Spontaneous Speech Using SpeechRater? v1.0. Research Report. ETS RR-08-62. , 2008 .

[27]  P. Boersma Praat : doing phonetics by computer (version 5.1.05) , 2009 .

[28]  Su-Youn Yoon,et al.  Acoustic Feature-based Non-scorable Response Detection for an Automated Speaking Proficiency Assessment , 2012, INTERSPEECH.