Improving Speech/Music Discrimination Parameter Using Time-Averaged MFCC

Choi, Mu-Yeol;Kim, Hyung-Soon;

MALSORI (대한음성학회지:말소리)

Issue 64
/
Pages.155-169
/
2007
/
1226-1173(pISSN)

The Korean Society Of Phonetic Sciences And Speech Technology (대한음성학회)

Improving Speech/Music Discrimination Parameter Using Time-Averaged MFCC

MFCC의 단구간 시간 평균을 이용한 음성/음악 판별 파라미터 성능 향상

최무열 (부산대학교 전자공학과 음성통신연구실) ;
김형순 (부산대학교 전자공학과 음성통신연구실)

Published : 2007.12.30

PDF

Download PDF

⟨ Previous Next ⟩

Abstract

Discrimination between speech and music is important in many multimedia applications. In our previous work, focusing on the spectral change characteristics of speech and music, we presented a method using the mean of minimum cepstral distances (MMCD), and it showed a very high discrimination performance. In this paper, to further improve the performance, we propose to employ time-averaged MFCC in computing the MMCD. Our experimental results show that the proposed method enhances the discrimination between speech and music. Moreover, the proposed method overcomes the weakness of the conventional MMCD method whose performance is relatively sensitive to the choice of the frame interval to compute the MMCD.

MALSORI (대한음성학회지:말소리)

Improving Speech/Music Discrimination Parameter Using Time-Averaged MFCC

MFCC의 단구간 시간 평균을 이용한 음성/음악 판별 파라미터 성능 향상

Abstract

Keywords

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)