DOI QR코드

DOI QR Code

A study on the release burst spectra of the voiceless plosives from the English and Korean spontaneous speech corpus

영어와 한국어 자연발화 코퍼스에서의 무성 폐쇄음 개방 파열 스펙트럼 연구

  • Received : 2017.10.11
  • Accepted : 2017.12.07
  • Published : 2017.12.31

Abstract

The purpose of this work is to examine the English and Korean voiceless plosives from the Buckeye[15] and Seoul[16] corpus in terms of their static spectral characteristics. The plosives were automatically extracted by a Praat script. In order to estimate the percent correctness in the classification of the plosives, discriminant analyses were performed whose trainings were based on four spectral moments, i.e. the center of gravity, variance, skewness and kurtosis as suggested in [6]. Another set of discriminant analyses were performed based on the spectral tilts. In the last set of analyeses, the spectral moments and tilts were both used in the training. Results showed that the correct classification rate did not exceed around 65% in the best case, which suggested that phonetic cues other than the release burst would be necessary including the dynamic spectral aspects and vowel-onset cues.

Keywords

References

  1. Cooper, F., Delattre, P., Liberman, A., Borst, J., & Gerstman, L. (1952). Some experiments on the perception of synthetic speech sounds. The Journal of the Acoustical Society of America, 24, 597-606. https://doi.org/10.1121/1.1906940
  2. Halle, M., Hughes, G., & Radley, J. (1957). Acoustic properties of stop consonants. The Journal of the Acoustical Society of America, 29, 107-116. https://doi.org/10.1121/1.1908634
  3. Stevens, K., & Blumstein, S. (1975). Quantal aspects of consonant production and perception: A study of retroflex stop consonants. Journal of Phonetics, 3, 215-233.
  4. Stevens, K., & Blumstein, S. (1978). Invariant cues for the place of articulation in stop consonants. The Journal of the Acoustical Society of America, 64, 1358-1368. https://doi.org/10.1121/1.382102
  5. Blumstein, S., & Stevens, K. (1979). Acoustic invariance in speech production: Evidence from measurements of the spectral characteristics of stop consonants. The Journal of the Acoustical Society of America, 66, 1001-1017. https://doi.org/10.1121/1.383319
  6. Forrest, K., Weismer, G., Milenkovic, P., & Dougall, R. (1988). Statistical analysis of word-initial voiceless obstruents: preliminary data. The Journal of the Acoustical Society of America, 84, 115-123. https://doi.org/10.1121/1.396977
  7. Winitz, H., Scheib, M., & Reeds, J. (1972). Identification of stops and vowels for the burst portion of the /p, t, k/ isolated from conversational speech. The Journal of the Acoustical Society of America, 51, 1309-1317. https://doi.org/10.1121/1.1912976
  8. Cole, R., & Scott, B. (1974). Toward a theory of speech perception. Psychological Review, 81, 348-374. https://doi.org/10.1037/h0036656
  9. Ohde, R., & Sharf, D. (1977). Order effect of acoustic segments of VC and CV syllables on stop and vowel identification. Journal of Speech and Hearing Research, 20, 543-554. https://doi.org/10.1044/jshr.2003.543
  10. Kewley-Port, D. (1983). Time-varying features as correlates of place of articulation in stop consonants. The Journal of the Acoustical Society of America, 73, 322-335. https://doi.org/10.1121/1.388813
  11. Kewley-Port, D. (1983). Measurement of formant transitions in naturally produced stop consonant-vowel syllables. The Journal of the Acoustical Society of America, 72, 379-389.
  12. Bonneau, A., Djezzar, L., & Laprie, Y. (1996). Perception of the place of articulation of French stop bursts. The Journal of the Acoustical Society of America, 100, 555-564. https://doi.org/10.1121/1.415866
  13. Smits, R., ten Bosch, L., & Collier, R. (1996). Evaluation of various sets of acoustic cues for the perception of prevocalic stop consonants. I. Perception experiment. The Journal of the Acoustical Society of America, 100, 3582-3864.
  14. Park, H. (2003). Spectral characteristics of release bursts. Proceedings of the Korean Society of Speech Sciences (pp. 159-162). (박한상 (2003). 개방 파열의 스펙트럼상의 특성. 대한음성학회 학술대회 논문집, 159-162.)
  15. Pitt, M., Dilley, L., Johnson, K., Kiesling, S., Raymond, W., Hume, E., & Fosler-Lussier, E. (2007). Buckeye Corpus of Conversational Speech (2nd release). [www.buckeyecorpus.osu.edu] Columbus, OH: Department of Psychology, Ohio State University (Distributor).
  16. Yun, W., Yoon, K., Park, S., Lee, J., Cho, S., Kang, D., Byun, K., Hahn, H., & Kim, J. (2015). The Korean corpus of spontaneous speech. Phonetics and Speech Sciences, 7(2), 103-109. https://doi.org/10.13064/KSSS.2015.7.2.103
  17. Boersma, P. (2001). Praat, a system for doing phonetics by computer. Glot International, 5(9/10), 341-345.
  18. R Studio Team. (2015). RStudio: Integrated Development for R. RStudio, Inc., Boston, MA. Retrieved from http://www.rstudio.com/ on March 31, 2016.
  19. Steriade, D. (1999). Phonetics in Phonology: The Case of Laryngeal Neutralization. UCLA Working Papers in Linguistics, 2, 25-146.