HMM-based missing feature reconstruction for robust speech recognition in additive noise environments

Cho, Ji-Won;Park, Hyung-Min;

doi:10.13064/KSSS.2014.6.4.127

Phonetics and Speech Sciences (말소리와 음성과학)

Volume 6 Issue 4
/
Pages.127-132
/
2014
/
2005-8063(pISSN)
/
2586-5854(eISSN)

Korean Society of Speech Sciences (한국음성학회)

DOI QR Code

HMM-based missing feature reconstruction for robust speech recognition in additive noise environments

가산잡음환경에서 강인음성인식을 위한 은닉 마르코프 모델 기반 손실 특징 복원

조지원 (서강대학교) ;
박형민 (서강대학교)

Received : 2014.11.04
Accepted : 2014.12.13
Published : 2014.12.31

https://doi.org/10.13064/KSSS.2014.6.4.127 Citation PDF KSCI

Download PDF

⟨ Previous Next ⟩

Abstract

This paper describes a robust speech recognition technique by reconstructing spectral components mismatched with a training environment. Although the cluster-based reconstruction method can compensate the unreliable components from reliable components in the same spectral vector by assuming an independent, identically distributed Gaussian-mixture process of training spectral vectors, the presented method exploits the temporal dependency of speech to reconstruct the components by introducing a hidden-Markov-model prior which incorporates an internal state transition plausible for an observed spectral vector sequence. The experimental results indicate that the described method can provide temporally consistent reconstruction and further improve recognition performance on average compared to the conventional method.

Keywords

References

Acero, A. (1990). Acoustic and Environmental Robustness in Automatic Speech Recognition, PhD. thesis, Dept. of Electrical and Computer Engineering, Carnegie Mellon University, PA.
Raj, B. & Stern, R. M. (2005). Missing feature approaches in speech recognition, IEEE Signal Processing Magazine, vol. 22, 101-116.
Peinado, A. M., Sanchez, V., Segura, J. C., & Perez-Cordoba, J. L. (2001). MMSE-based Channel Mitigation for Distributed Speech Recognition, Proc. EUROSPEECH, 2707-2710
Peinado, A. M., Sanchez, V., Perez-Cordoba, J. L., Segura, J. C., & Rubio, J. (2002). HMM-Based Methods for Channel Error Mitigation in Distributed Speech Recognition, Proc. ICSLP02, 2205-2208.
Borgstrom, B. J. & Alwan A. (2010). HMM-based reconstruction of unreliable spectrographic data for noise robust speech recognition, IEEE Transactions on Audio, Speech, and Language Processing, vol. 18, 1612-1623. https://doi.org/10.1109/TASL.2009.2038811
Huang, X., Acero, A., & Hon, H.-W. (2001). Spoken language processing: a guide to theory, algorithm, and system development, NJ: Prentice-Hall.
Cho, J.-W. & Park, H.-M. (2013). An efficient HMM-based feature enhancement method with filter esimation for reverberant speech recognition, IEEE Signal Processing Letter, vol. 20, 1199-1202. https://doi.org/10.1109/LSP.2013.2283585
Price, P., Fisher, W.M., Bernstein, J., Pallet, D.S.(1988). The DARPA 1000-Word Resource Management Database for Continuous Speech Recognition, Proc. IEEE ICASSP, 651-654
Young, S., Evermann, G., Gales, M., Hain, T., Kershaw, D., Moore, G., Odell, J., Ollason, D., Povey, D., Valtchev, V., & Woodland, P. (2006). The HTK book, Cambridge, UK: Cambridge University Press.
Varga, A., Steeneken, H.J. (1993) Assessment for automatic speech recognition: 2. In: NOISEX 1992: A Database and anExpeiment to Study the Effect of Additive Noise on Speech Recognition Systems. Speech Comm., vol. 12, 247-251. https://doi.org/10.1016/0167-6393(93)90095-3
Sound Jay. www.soundjay.com.

Phonetics and Speech Sciences (말소리와 음성과학)

HMM-based missing feature reconstruction for robust speech recognition in additive noise environments

가산잡음환경에서 강인음성인식을 위한 은닉 마르코프 모델 기반 손실 특징 복원

Abstract

Keywords

References

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)