Modeling tone and intonation in Mandarin and English as a process of target approximation
Journal article
Authors/Editors
Strategic Research Themes
No matching items found.
Publication Details
Author list: Prom-On S., Xu Y., Thipakorn B.
Publication year: 2009
Volume number: 125
Issue number: 1
Start page: 405
End page: 424
Number of pages: 20
ISSN: 0001-4966
eISSN: 0001-4966
Languages: English-Great Britain (EN-GB)
View in Web of Science | View on publisher site | View citing articles in Web of Science
Abstract
This paper reports the development of a quantitative target approximation (qTA) model for generating F0 contours of speech. The qTA model simulates the production of tone and intonation as a process of syllable-synchronized sequential target approximation [Xu, Y. (2005). "Speech melody as articulatorily implemented communicative functions," Speech Commun. 46, 220-251]. It adopts a set of biomechanical and linguistic assumptions about the mechanisms of speech production. The communicative functions directly modeled are lexical tone in Mandarin and lexical stress in English and focus in both languages. The qTA model is evaluated by extracting function-specific model parameters from natural speech via supervised learning (automatic analysis by synthesis) and comparing the F0 contours generated with the extracted parameters to those of natural utterances through numerical evaluation and perceptual testing. The F0 contours generated by the qTA model with the learned parameters were very close to the natural contours in terms of root mean square error, rate of human identification of tone, and focus and judgment of naturalness by human listeners. The results demonstrate that the qTA model is both an effective tool for research on tone and intonation and a potentially effective system for automatic synthesis of tone and intonation. ฉ 2009 Acoustical Society of America.
Keywords
No matching items found.