DOI QR코드

DOI QR Code

Analyzing vowel variation in Korean dialects using phone recognition

  • Jooyoung Lee (Department of Linguistics, Seoul National University) ;
  • Sunhee Kim (Department of French Language Education, Seoul National University) ;
  • Minhwa Chung (Department of Linguistics, Seoul National University)
  • Received : 2023.11.22
  • Accepted : 2023.12.11
  • Published : 2023.12.31

Abstract

This study aims to propose an automatic method of detecting vowel variation in the Korean dialects of Gyeong-sang and Jeol-la. The method is based on error patterns extracted using phone recognition. Canonical and recognized phone sequences are compared, and statistical analyses distinguish the vowels appearing in both dialects, the dialect-common vowels, and the vowels with high mismatch rates for each dialect. The dialect-common vowels show monophthongization of diphthongs. The vowels unique to the dialects are /we/ to [e] and /ʌ/ to [ɰ] for Gyeong-sang dialect, and /ɰi/ to [ɯ] in Jeol-la dialect. These results corroborate previous dialectology reports regarding phonetic realization of the Korean dialects. The current method provides a possibility of automatic explanation of the dialect patterns.

Keywords

Acknowledgement

This work was supported by Institute of Information & communications Technology Planning & Evaluation (IITP) grant funded by the Korean government (MSIT) (No. 2021-0-00575, "Development of Voice Phishing Prevention Technology Based on Speech and Text Deep Learning").

References

  1. AI-Hub. (2021). AI-Hub senior spontaneous speech corpus. Retrieved from https://www.aihub.or.kr
  2. Babu, A., Wang, C., Tjandra, A., Lakhotia, K., Xu, Q., Goyal, N., Singh, K., ... Auli, M. (2022, September). XLS-R: Self-supervised cross-lingual speech representation learning at scale. Proceedings of Interspeech 2022 (pp. 2278-2282). Incheon, Korea.
  3. Bae, H. (2012). A study on the aspects of vowel 'ㅟ' change at Daegu. Korean Language and Literature Society, 116, 27-50.
  4. Baevski, A., Zhou, Y., Mohamed, A., & Auli, M. (2020). wav2vec 2.0: A framework for self-supervised learning of speech representations. Advances in Neural Information Processing Systems, 33, 12449-12460.
  5. Bisani, M., & Ney, H. (2008). Joint-sequence models for grapheme-to-phoneme conversion. Speech Communication, 50(5), 434-451. https://doi.org/10.1016/j.specom.2008.01.002
  6. Chambers, J. K., & Trudgill, P. (1998). Dialectology. Cambridge, UK: Cambridge University Press.
  7. Chen, Y. (2008). The acoustic realization of vowels of Shanghai Chinese. Journal of Phonetics, 36(4), 629-648. https://doi.org/10.1016/j.wocn.2008.03.001
  8. Clopper, C. G., & Pisoni, D. B. (2004). Some acoustic cues for the perceptual categorization of American English regional dialects. Journal of Phonetics, 32(1), 111-140. https://doi.org/10.1016/S0095-4470(03)00009-3
  9. Clopper, C. G., & Pisoni, D. B. (2006). The nationwide speech project: A new corpus of American English dialects. Speech Communication, 48(6), 633-644.
  10. Diehl, R. L., Lindblom, B., Hoemeke, K. A., & Fahey, R. P. (1996). On explaining certain male-female differences in the phonetic realization of vowel categories. Journal of Phonetics, 24(2), 187-208. https://doi.org/10.1006/jpho.1996.0011
  11. Hasan, R. (2004). Code, register and social dialect. Class, Codes and Control, 2, 253-292.
  12. Jang, S. (2019). A study on the orthography of the Jeollabuk-do dialect dictionary. Korean Language and Literature, 71, 97-120. https://doi.org/10.23016/KLLJ.2019.71.71.97
  13. Jang, S. Y. (2021). Influence of standard Korean and Gyeongsang regional dialect on the pronunciation of English vowels. Phonetics and Speech Sciences, 13(4), 1-7. https://doi.org/10.13064/KSSS.2021.13.4.001
  14. Kim, H. G., Choi, Y. S., & Kim, D. S. (2006). An experimental study of Korean dialectal speech. Speech Sciences, 13(3), 49-65.
  15. Kwak, C. G. (2003). The vowel system of contemporary Korean and direction of change. Journal of Korean Linguistics, 41, 59-91.
  16. Labov, W. (1973). Sociolinguistic patterns (conduct and communication, 4). Philadelphia, PA: University of Pennsylvania Press.
  17. Labov, W. (2006). The social stratification of English in New York city. Cambridge, UK: Cambridge University Press.
  18. Li, K., Qian, X., & Meng, H. (2016). Mispronunciation detection and diagnosis in L2 English speech using multidistribution deep neural networks. IEEE/ACM Transactions on Audio, Speech, and Language Processing, 25(1), 193-207.
  19. Lin, B., & Wang, L. (2022, May). Phoneme mispronunciation detection by jointly learning to align. Proceeding of the ICASSP 2022 - 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (pp. 6822-6826). Singapore, Singapore.
  20. Major, R. C., Fitzmaurice, S. M., Bunta, F., & Balasubramanian, C. (2005). Testing the effects of regional, ethnic, and international dialects of English on listening comprehension. Language Learning, 55(1), 37-69.
  21. McAuliffe, M., Socolof, M., Mihuc, S., Wagner, M., & Sonderegger, M. (2017, August). Montreal forced aligner: Trainable text-speech alignment using kaldi. Proceedings of Interspeech 2017 (pp. 498-502). Stockholm, Sweden.
  22. McCullough, E. A., Clopper, C. G., & Wagner, L. (2019). Regional dialect perception across the lifespan: Identification and discrimination. Language and Speech, 62(1), 115-136. https://doi.org/10.1177/0023830917743277
  23. Paek, D. (1999). Diachronic changes in Yeongnam dialect. Journal of Korean Cultural Studies, 20, 23-79.
  24. Park, J. (2003). The opacity of vowel harmony in Jeon-Buk dialect. Korean Language and Literature, 134, 155-171.
  25. Park, J. (2022). The phonological perception and articulation by generation in Daegu dialect /ㅡ/ and /ㅓ/. The Journal of Studies in Language, 38(2), 127-142.
  26. Park, S. (2019). On the phonological representation and acoustic properties of the diphthong /i/ in Korean. Studies in Modern Grammar, (102), 165-183.
  27. Rouas, J. L. (2007). Automatic prosodic variations modeling for language and dialect discrimination. IEEE Transactions on Audio, Speech, and Language Processing, 15(6), 1904-1911. https://doi.org/10.1109/TASL.2007.900094
  28. Vicenik, C., & Sundara, M. (2013). The role of intonation in language and dialect discrimination by adults. Journal of Phonetics, 41(5), 297-306. https://doi.org/10.1016/j.wocn.2013.03.003
  29. Xu, A., Gerazov, B., van Niekerk, D., Krug, P. K., Prom-on, S., Birkholz, P., & Xu, Y. (2023, August). Computational models for articulatory learning of English diphthongs: One dynamic target vs. two static targets. Proceedings of the 20th International Congress of Phonetic Sciences (pp. 4140-4144). Prague, Czech Republic.
  30. Yeo, E. J., Ryu, H., Lee, J., Kim, S., & Chung, M. (2023, August). Comparison of L2 Korean pronunciation error patterns from five L1 backgrounds by using automatic phonetic transcription. Proceedings of the 20th International Congress of Phonetic Sciences (pp. 2720-2724). Prague, Czech Republic.
  31. Yoon, T. J., Kang, Y., Han, S., Maeng, H. S., Lee, J., & Kim, K. (2015, August). A corpus-based approach to dialectal variation in Korean vowels. Proceedings of the 18th International Congress of Phonetic Sciences (ICPhS 2015). Glasgow, UK.
  32. Zhao, D., Park, J., & Seong, C. (2023). Acoustic characteristics of glides and nuclear vowels of Korean Diphthongs in coarticulation condition. Han-Geul, 84(1), 5-43. https://doi.org/10.22557/HG.2023.3.84.1.5