DOI QR코드

DOI QR Code

성인 포먼트 측정에서의 최적 세팅 구현: Praat software와 관련하여

The implementation of Korean adult's optimal formant setting by Praat scripting

  • 박지연 (충남대학교 언어병리학과) ;
  • 성철재 (충남대학교 언어학과)
  • 투고 : 2019.10.09
  • 심사 : 2019.11.21
  • 발행 : 2019.12.31

초록

한국인 성인을 대상으로 최적의 포먼트 분석이 가능하도록 자동화된 프랏 스크립트를 구현하였다. 최적의 포먼트 분석이란 프랏에서 포먼트 분석 시 설정하는 2가지 세팅 파라미터(최대 포먼트, 포먼트 개수)를 조합하여 측정된 제1, 제2 포먼트의 편차합이 최소일 때를 가리킨다. 포먼트 분석의 신뢰성을 높이기 위해서는 성별이나 모음의 종류에 따라 LPC 차수를 다르게 설정해야 하는데 프랏 매뉴얼에서는 최대 포먼트 설정 값으로 남성 5,000 Hz, 여성 5,500 Hz, 측정개수는 5개를 권고한다. 그러나 이렇게 권고된 포먼트 세팅 설정이 한국어 모음에 대해서도 타당한지 검증이 필요하다. 본 연구에서 구현한 4가지 스크립트를 적용한 결과, 각 모음별 포먼트 산점도로 확인하였을 때 특히 여성의 경우 스크립트에 따라 측정된 포먼트 변이의 폭이 두드러지는 차이를 보였다. 포먼트 산점도와 통계 결과를 통해 linear_script와 qtone_script가 포먼트 측정에서 더 신뢰성이 높은 것을 알 수 있었다. Linear_script, qtone_script에서 최적의 세팅으로 설정된 최대 포먼트와 포먼트 개수의 데이터 경향성을 살펴보면, 전설 모음 [이, 에]의 경우 권고 설정보다 최대 포먼트 값은 높게, 포먼트 개수의 값은 적게 설정되었다. 반면 후설모음 [오, 우]의 경우, 권고 설정보다 최대 포먼트 값은 낮게, 포먼트 개수의 값은 많게 설정되는 것을 확인할 수 있었다.

An automated Praat script was implemented to measure optimal formant frequencies for adults. Optimal formant analysis could be interpreted to show that the deviation of formant frequency that resulted from the two variously combined setting parameters (maximum formant and number of formants) was minimal. To increase the reliability of formant analysis, LPC order should be set differently, based on the gender or vowel type. Praat recommends 5,000 Hz and 5,500 Hz as maximum formant settings and, at the same time, recommends 5 as the number of formants for males and females. However, verification is needed to determine whether these recommended settings are valid for Korean vowels. Statistical analysis showed that formant frequencies significantly varied across the adapted scripts, especially with respect to the data on females. Formant plots and statistical results showed that linear_script and qtone_script are much more reliable in formant measurements. Among four kinds of scripts, the linear and qtone_scripts proved to be more stable and reliable. While the linear_script was designed to have a linearly increased formant step in for-loop, the increment of formant step in the qtone_script was arranged by quarter tone scale (base frequency×common ratio ($\sqrt[24]{2}$)). When looking at the tendency of the formant setting drawn by the two referred algorithms in the context of front vowel [i, e], the maximum formant was set higher; and the number of formants set at a lower value than recommended by Praat. The back vowel [o, u], on the contrary, has a lower maximum formant and a higher number of formants than the standard setting.

키워드

참고문헌

  1. Bae, J. (2003). The pronunciation of Korean. Seoul: Samgyeong.
  2. Behrman, A. (2007). Speech and voice science. Oxford: Plural Pub.
  3. Boersma, P., & Weenink, D. (2014). Praat: Doing phonetics by computer [Computer program]. Retrieved from http://www.praat.org
  4. Chiders, D. G. (1978). Modern spectrum analysis (pp. 252-255). New York: IEEE Press.
  5. Escudero, P., Boersma, P., Rauber, A. S., & Bion, R. A. H. (2009). A cross-dialect acoustic description of vowels: Brazilian and European Portuguese. Journal of Acoustical Society of America, 126(3), 1379-1393. https://doi.org/10.1121/1.3180321
  6. Fry, D. B. (1982). The physics of speech. Cambridge: Cambridge University Press.
  7. Jin, S. M. (2004). Introduction of acoustic analysis of voice. Korean Journal of Otolaryngology-Head and Neck Surgery, 47(10), 943-949.
  8. Kent, R. D., & Read, C. (2002). The acoustic analysis of speech (2nd ed.). New York: Thomson Learning.
  9. Kim, E. (2012). (A) Study on native Chinese speaker's acquisition of Korean vowels (Master's thesis). Chungnam National University, Daejeon, Korea.
  10. Kim, J. (2015). Comparison of phonetic characteristics of vowel pronunciation by children who grew up in a multi-lingual (Chinese-Korean) environment (Master's thesis). Chungnam National University, Daejeon, Korea.
  11. Kim, J., & Seong, C. (2016). The change of vowel characteristics for the Dysarthric speech along with speaking style. Phonetics and Speech Sciences, 8(3), 51-59. https://doi.org/10.13064/KSSS.2016.8.3.051
  12. Lee, J. R. (2017). Feasibility of acoustic analysis with laryngoscopic examination at outpatient clinic (Master's thesis). Ulsan National University, Ulsan, Korea.
  13. Park, J., & Seong, C. (2018) The implementation of children’s automated fromant setting by Praat scripting. Phonetics and Speech Sciences, 10(4), 1-10. https://doi.org/10.13064/KSSS.2018.10.4.001
  14. Park, S. (2008). A validity study of formant analysis of vowel errors of children: Focused on vowel /u/. Journal of Speech & Hearing Disorders, 17(3), 117-131.
  15. Press, W. H., Teukolsky, S. A., Vetterling, W. T., & Flannery, B. P. (1992). Numerical recipes in C: The art of scientific computing. (2nd ed.). Cambridge: Cambridge University Press.
  16. Sapir, S., Ramig, L. O., Spielman, J. L., & Fox, C. (2010). Formant centralization ratio: A proposal for a new acoustic measure of dysarthric speech. Journal of Speech, Language, and Hearing Research, 53(1), 114-125. https://doi.org/10.1044/1092-4388(2009/08-0184)
  17. Seong, C. J. (2005). A formant analysis of the Korean monophthongs of the college students speaking Chungnam dialect, Language, 43, 189-213.
  18. Seong, C. J., Kwon, O. W., Lee, J. H., & Gim, C. G. (2008). A tonal analysis of East-Southern Gyeongnam dialect using Q-tone perceptual sense grade. Hangeul, 279, 5-33.
  19. Song, I., & Seong, C. (2018). Characteristics of 2 to 4 year old Korean children's production of monophthongs and diphthongs. Phonetics and Speech Sciences, 10(1), 65-74. https://doi.org/10.13064/KSSS.2018.10.1.065
  20. Sim, H., Choi, C., & Choi, S. (2016). Characteristics of vowel formants, vowel space, and speech intelligibility produced by children aged 3-6 years. Audiology and Speech Research, 12(4), 260-268. https://doi.org/10.21848/asr.2016.12.4.260
  21. Yang, B. (2019). A comparison of normalized formant trajectories of English vowels produced by American men and women. Phonetics and Speech Sciences, 11(1), 1-8. https://doi.org/10.13064/KSSS.2019.11.1.001
  22. Yoon, T. J., & Kang, Y. (2014). Monophthong analysis on a largescale speech corpus of read-style Korean. Phonetics and Speech Saciences, 6(3), 139-145. https://doi.org/10.13064/KSSS.2014.6.3.139