Modelling Duration In Text-to-Speech Systems

Chung Hyunsong;

MALSORI (대한음성학회지:말소리)

Issue 49
/
Pages.159-174
/
2004
/
1226-1173(pISSN)

The Korean Society Of Phonetic Sciences And Speech Technology (대한음성학회)

Modelling Duration In Text-to-Speech Systems

Chung Hyunsong

정현성 (대구대)

Published : 2004.03.01

PDF

Download PDF

⟨ Previous Next ⟩

Abstract

The development of the durational component of prosody modelling was overviewed and discussed in text-to-speech conversion of spoken English and Korean, showing the strengths and weaknesses of each approach. The possibility of integrating linguistic feature effects into the duration modelling of TTS systems was also investigated. This paper claims that current approaches to language timing synthesis still require an understanding of how segmental duration is affected by context. Three modelling approaches were discussed: sequential rule systems, Classification and Regression Tree (CART) models and Sums-of-Products (SoP) models. The CART and SoP models show good performance results in predicting segment duration in English, while it is not the case in the SoP modelling of spoken Korean.

MALSORI (대한음성학회지:말소리)

Modelling Duration In Text-to-Speech Systems

Abstract

Keywords

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)