The implementation of Korean adult's optimal formant setting by Praat scripting

Park, Jiyeon;Seong, Cheoljae;

doi:10.13064/KSSS.2019.11.4.097

말소리와 음성과학 (Phonetics and Speech Sciences)

제11권4호
/
Pages.97-108
/
2019
/
2005-8063(pISSN)
/
2586-5854(eISSN)

한국음성학회 (Korean Society of Speech Sciences)

DOI QR Code

성인 포먼트 측정에서의 최적 세팅 구현: Praat software와 관련하여

The implementation of Korean adult's optimal formant setting by Praat scripting

박지연 (충남대학교 언어병리학과) ;
성철재 (충남대학교 언어학과)

Park, Jiyeon (Speech-Language Pathology, Chungnam National University) ;
Seong, Cheoljae (Linguistics, Chungnam National University)

투고 : 2019.10.09
심사 : 2019.11.21
발행 : 2019.12.31

https://doi.org/10.13064/KSSS.2019.11.4.097 인용 PDF KSCI

PDF 다운로드

⟨ 이전 논문 다음 논문 ⟩

초록

한국인 성인을 대상으로 최적의 포먼트 분석이 가능하도록 자동화된 프랏 스크립트를 구현하였다. 최적의 포먼트 분석이란 프랏에서 포먼트 분석 시 설정하는 2가지 세팅 파라미터(최대 포먼트, 포먼트 개수)를 조합하여 측정된 제1, 제2 포먼트의 편차합이 최소일 때를 가리킨다. 포먼트 분석의 신뢰성을 높이기 위해서는 성별이나 모음의 종류에 따라 LPC 차수를 다르게 설정해야 하는데 프랏 매뉴얼에서는 최대 포먼트 설정 값으로 남성 5,000 Hz, 여성 5,500 Hz, 측정개수는 5개를 권고한다. 그러나 이렇게 권고된 포먼트 세팅 설정이 한국어 모음에 대해서도 타당한지 검증이 필요하다. 본 연구에서 구현한 4가지 스크립트를 적용한 결과, 각 모음별 포먼트 산점도로 확인하였을 때 특히 여성의 경우 스크립트에 따라 측정된 포먼트 변이의 폭이 두드러지는 차이를 보였다. 포먼트 산점도와 통계 결과를 통해 linear_script와 qtone_script가 포먼트 측정에서 더 신뢰성이 높은 것을 알 수 있었다. Linear_script, qtone_script에서 최적의 세팅으로 설정된 최대 포먼트와 포먼트 개수의 데이터 경향성을 살펴보면, 전설 모음 [이, 에]의 경우 권고 설정보다 최대 포먼트 값은 높게, 포먼트 개수의 값은 적게 설정되었다. 반면 후설모음 [오, 우]의 경우, 권고 설정보다 최대 포먼트 값은 낮게, 포먼트 개수의 값은 많게 설정되는 것을 확인할 수 있었다.

An automated Praat script was implemented to measure optimal formant frequencies for adults. Optimal formant analysis could be interpreted to show that the deviation of formant frequency that resulted from the two variously combined setting parameters (maximum formant and number of formants) was minimal. To increase the reliability of formant analysis, LPC order should be set differently, based on the gender or vowel type. Praat recommends 5,000 Hz and 5,500 Hz as maximum formant settings and, at the same time, recommends 5 as the number of formants for males and females. However, verification is needed to determine whether these recommended settings are valid for Korean vowels. Statistical analysis showed that formant frequencies significantly varied across the adapted scripts, especially with respect to the data on females. Formant plots and statistical results showed that linear_script and qtone_script are much more reliable in formant measurements. Among four kinds of scripts, the linear and qtone_scripts proved to be more stable and reliable. While the linear_script was designed to have a linearly increased formant step in for-loop, the increment of formant step in the qtone_script was arranged by quarter tone scale (base frequency×common ratio ($\sqrt[24]{2}$)). When looking at the tendency of the formant setting drawn by the two referred algorithms in the context of front vowel [i, e], the maximum formant was set higher; and the number of formants set at a lower value than recommended by Praat. The back vowel [o, u], on the contrary, has a lower maximum formant and a higher number of formants than the standard setting.

키워드

참고문헌

Bae, J. (2003). The pronunciation of Korean. Seoul: Samgyeong.
Behrman, A. (2007). Speech and voice science. Oxford: Plural Pub.
Boersma, P., & Weenink, D. (2014). Praat: Doing phonetics by computer [Computer program]. Retrieved from http://www.praat.org
Chiders, D. G. (1978). Modern spectrum analysis (pp. 252-255). New York: IEEE Press.
Escudero, P., Boersma, P., Rauber, A. S., & Bion, R. A. H. (2009). A cross-dialect acoustic description of vowels: Brazilian and European Portuguese. Journal of Acoustical Society of America, 126(3), 1379-1393. https://doi.org/10.1121/1.3180321
Fry, D. B. (1982). The physics of speech. Cambridge: Cambridge University Press.
Jin, S. M. (2004). Introduction of acoustic analysis of voice. Korean Journal of Otolaryngology-Head and Neck Surgery, 47(10), 943-949.
Kent, R. D., & Read, C. (2002). The acoustic analysis of speech (2nd ed.). New York: Thomson Learning.
Kim, E. (2012). (A) Study on native Chinese speaker's acquisition of Korean vowels (Master's thesis). Chungnam National University, Daejeon, Korea.
Kim, J. (2015). Comparison of phonetic characteristics of vowel pronunciation by children who grew up in a multi-lingual (Chinese-Korean) environment (Master's thesis). Chungnam National University, Daejeon, Korea.
Kim, J., & Seong, C. (2016). The change of vowel characteristics for the Dysarthric speech along with speaking style. Phonetics and Speech Sciences, 8(3), 51-59. https://doi.org/10.13064/KSSS.2016.8.3.051
Lee, J. R. (2017). Feasibility of acoustic analysis with laryngoscopic examination at outpatient clinic (Master's thesis). Ulsan National University, Ulsan, Korea.
Park, J., & Seong, C. (2018) The implementation of children’s automated fromant setting by Praat scripting. Phonetics and Speech Sciences, 10(4), 1-10. https://doi.org/10.13064/KSSS.2018.10.4.001
Park, S. (2008). A validity study of formant analysis of vowel errors of children: Focused on vowel /u/. Journal of Speech & Hearing Disorders, 17(3), 117-131.
Press, W. H., Teukolsky, S. A., Vetterling, W. T., & Flannery, B. P. (1992). Numerical recipes in C: The art of scientific computing. (2nd ed.). Cambridge: Cambridge University Press.
Sapir, S., Ramig, L. O., Spielman, J. L., & Fox, C. (2010). Formant centralization ratio: A proposal for a new acoustic measure of dysarthric speech. Journal of Speech, Language, and Hearing Research, 53(1), 114-125. https://doi.org/10.1044/1092-4388(2009/08-0184)
Seong, C. J. (2005). A formant analysis of the Korean monophthongs of the college students speaking Chungnam dialect, Language, 43, 189-213.
Seong, C. J., Kwon, O. W., Lee, J. H., & Gim, C. G. (2008). A tonal analysis of East-Southern Gyeongnam dialect using Q-tone perceptual sense grade. Hangeul, 279, 5-33.
Song, I., & Seong, C. (2018). Characteristics of 2 to 4 year old Korean children's production of monophthongs and diphthongs. Phonetics and Speech Sciences, 10(1), 65-74. https://doi.org/10.13064/KSSS.2018.10.1.065
Sim, H., Choi, C., & Choi, S. (2016). Characteristics of vowel formants, vowel space, and speech intelligibility produced by children aged 3-6 years. Audiology and Speech Research, 12(4), 260-268. https://doi.org/10.21848/asr.2016.12.4.260
Yang, B. (2019). A comparison of normalized formant trajectories of English vowels produced by American men and women. Phonetics and Speech Sciences, 11(1), 1-8. https://doi.org/10.13064/KSSS.2019.11.1.001
Yoon, T. J., & Kang, Y. (2014). Monophthong analysis on a largescale speech corpus of read-style Korean. Phonetics and Speech Saciences, 6(3), 139-145. https://doi.org/10.13064/KSSS.2014.6.3.139

말소리와 음성과학 (Phonetics and Speech Sciences)

성인 포먼트 측정에서의 최적 세팅 구현: Praat software와 관련하여

The implementation of Korean adult's optimal formant setting by Praat scripting

초록

키워드

참고문헌

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

자세히 찾기

이미지 검색 (β)