DOI QR코드

DOI QR Code

Speech recognition rates and acoustic analyses of English vowels produced by Korean students

  • Yang, Byunggon (Department of English Education, Pusan National University)
  • Received : 2022.04.27
  • Accepted : 2022.06.07
  • Published : 2022.06.30

Abstract

English vowels play an important role in verbal communication. However, Korean students tend to experience difficulty pronouncing a certain set of vowels despite extensive education in English. The aim of this study is to apply speech recognition software to evaluate Korean students' pronunciation of English vowels in minimal pair words and then to examine acoustic characteristics of the pairs in order to check their pronunciation problems. Thirty female Korean college students participated in the recording. Speech recognition rates were obtained to examine which English vowels were correctly pronounced. To compare and verify the recognition results, such acoustic analyses as the first and second formant trajectories and durations were also collected using Praat. The results showed an overall recognition rate of 54.7%. Some students incorrectly switched the tense and lax counterparts and produced the same vowel sounds for qualitatively different English vowels. From the acoustic analyses of the vowel formant trajectories, some of these vowel pairs were almost overlapped or exhibited slight acoustic differences at the majority of the measurement points. On the other hand, statistical analyses on the first formant trajectories of the three vowel pairs revealed significant differences throughout the measurement points, a finding that requires further investigation. Durational comparisons revealed a consistent pattern among the vowel pairs. The author concludes that speech recognition and analysis software can be useful to diagnose pronunciation problems of English-language learners.

Keywords

References

  1. Boersma, P., & Weenink, D. (2021). Praat: Doing phonetics by computer (version 6.2) [Computer program]. Retrieved from http://www.praat.org/
  2. Davenport, M., & Hannahs, S. J. (1998). Introducing phonetics and phonology. London, UK: Hodder Arnold.
  3. De Decker, P. M., & Nycz, J. R. (2012). Are tense [ae]s really tense? The mapping between articulation and acoustics. Lingua, 122(7), 810-821. https://doi.org/10.1016/j.lingua.2012.01.003
  4. Delattre, P. C., Liberman, A. M., & Cooper, F. S. (1955). Acoustic loci and transitional cues for consonants. Journal of the Acoustical Society of America, 27(4), 769-773. https://doi.org/10.1121/1.1908024
  5. Kennedy, R. (2022). The phonetics/phonology interface. In R. A. Knight, & J. Setter (Eds.), The Cambridge handbook of phonetics (pp. 682-706). Cambridge, UK: Cambridge University Press.
  6. Lee, S., & Rhee, S. C. (2019). The relationship between vowel production and proficiency levels in L2 English produced by Korean EFL learners. Phonetics and Speech Sciences, 11(2), 1-13. https://doi.org/10.13064/ksss.2019.11.2.001
  7. Millett, P. (2021). Accuracy of speech-to-text captioning for students who are deaf or hard of hearing. Journal of Educational, Pediatric & (Re)Habilitative Audiology, 25, 1-13.
  8. Nearey, T. (2006). English vowels. Linguistics 205 course notes of practical phonetics. Retrived from https://sites.ualberta.ca/~tnearey/Ling205/Week4/EnglishVowelsNarrow4Up.pdf
  9. Pickett, J. M. (1980). The sounds of speech communication: A primer of acoustic phonetics and speech perception (Perspectives in Audiology Series). Baltimore, MD: University Park Press.
  10. R Core Team. (2021). R: A language and environment for statistical computing (version 4.1.0) [Computer software]. Vienna, Austria: R Foundation for Statistical Computing. Retrieved from https://www.R-project.org/
  11. Soskuthy, M. (2017). Generalised additive mixed models for dynamic analysis in linguistics: A practical introduction. Retrieved from https://arxiv.org/abs/1703.05339v1
  12. van Rij, J. (2015). Overview of GAMM analysis of time series data. Retrieved from https://jacolienvanrij.com/Tutorials/GAMM.html
  13. Weckwerth, J. (2022). Vowels. In R. A. Knight, & J. Setter (Eds.), The Cambridge handbook of phonetics (pp. 40-64). Cambridge, UK: Cambridge University Press.
  14. Wood, S. N. (2006). Generalised additive models: An introduction with R. Boca Raton, FL: CRC Press.
  15. Yang, B. (1990). Development of vowel normalization procedures: English and Korean (Doctoral dissertation). The University of Texas, Austin, TX.
  16. Yang, B. (1996). A comparative study of American English and Korean vowels produced by male and female speakers. Journal of Phonetics, 24(2), 245-261. https://doi.org/10.1006/jpho.1996.0013
  17. Yang, B. (2006). Discrimination of synthesized English vowels by American and Korean listeners. Speech Sciences, 13(1), 7-27.
  18. Yang, B. (2009). Formant trajectories of English vowels produced by American males. Phonetics and Speech Sciences, 1(3), 65-72.
  19. Yang, B. (2010). Formant trajectories of English high tense and lax vowels produced by Korean and American speakers. Korean Journal of Linguistics, 35(2), 407-423. https://doi.org/10.18855/lisoko.2010.35.2.005
  20. Yang, B. (2017). Google speech recognition of an English paragraph produced by college students in clear or casual speech styles. Phonetics and Speech Sciences, 9(4), 43-50. https://doi.org/10.13064/KSSS.2017.9.4.043
  21. Yang, B. (2019). A comparison of normalized formant trajectories of English vowels produced by American men and women. Phonetics and Speech Sciences, 11(1), 1-8. https://doi.org/10.13064/KSSS.2019.11.1.001
  22. Yang, B. (2020). An evaluation of Korean students' pronunciation of an English passage by a speech recognition application and two human raters. Phonetics and Speech Sciences, 12(4), 19-25. https://doi.org/10.13064/KSSS.2020.12.4.019
  23. Yang, B. (2022). Measuring vowels. In R. A. Knight, & J. Setter (Eds.), The Cambridge handbook of phonetics (pp. 261-284). Cambridge, UK: Cambridge University Press.
  24. Yang, B., & Whalen, D. H. (2015). Perception and production of English vowels by American males and females. Australian Journal of Linguistics, 35(2), 121-141. https://doi.org/10.1080/07268602.2015.1004998