Simulating vocal learning of spoken language: Beyond imitation

Journal article


Authors/Editors


Strategic Research Themes


Publication Details

Author listvan Niekerk, Daniel R.; Xu, Anqi; Gerazov, Branislav; Krug, Paul K.; Birkholz, Peter; Halliday, Lorna;
Prom-on, Santitham; Xu, Yi;

PublisherElsevier

Publication year2023

JournalSpeech Communication (0167-6393)

Volume number147

Start page51

End page62

Number of pages12

ISSN0167-6393

URLhttps://www.scopus.com/inward/record.uri?eid=2-s2.0-85147192695&doi=10.1016%2fj.specom.2023.01.003&partnerID=40&md5=b56bad3ccb806cf58e804c94b69fdadf

LanguagesEnglish-Great Britain (EN-GB)


View in Web of Science | View on publisher site | View citing articles in Web of Science


Abstract

Computational approaches have an important role to play in understanding the complex process of speech acquisition, in general, and have recently been popular in studies of vocal learning in particular. In this article we suggest that two significant problems associated with imitative vocal learning of spoken language, the speaker normalisation and phonological correspondence problems, can be addressed by linguistically grounded auditory perception. In particular, we show how the articulation of consonant–vowel syllables may be learnt from auditory percepts that can represent either individual utterances by speakers with different vocal tract characteristics or ideal phonetic realisations. The result is an optimisation-based implementation of vocal exploration – incorporating semantic, auditory, and articulatory signals – that can serve as a basis for simulating vocal learning beyond imitation. © 2023 The Author(s)


Keywords

Computational modelingSpeech ProcessingSpeech Synthesis


Last updated on 2023-17-10 at 07:37