Simulating vocal learning of spoken language: Beyond imitation

บทความในวารสาร

ผู้เขียน/บรรณาธิการ

สันติธรรม พรหมอ่อน

กลุ่มสาขาการวิจัยเชิงกลยุทธ์

การสร้างโมเดลการออกแบบและเพิ่มประสิทธิภาพ (วิศวกรรมและวิทยาศาสตร์เชิงคำนวณ)

รายละเอียดสำหรับงานพิมพ์

รายชื่อผู้แต่ง: van Niekerk, Daniel R.; Xu, Anqi; Gerazov, Branislav; Krug, Paul K.; Birkholz, Peter; Halliday, Lorna;
Prom-on, Santitham; Xu, Yi;

ผู้เผยแพร่: Elsevier

ปีที่เผยแพร่ (ค.ศ.): 2023

วารสาร: Speech Communication (0167-6393)

Volume number: 147

หน้าแรก: 51

หน้าสุดท้าย: 62

จำนวนหน้า: 12

นอก: 0167-6393

URL: https://www.scopus.com/inward/record.uri?eid=2-s2.0-85147192695&doi=10.1016%2fj.specom.2023.01.003&partnerID=40&md5=b56bad3ccb806cf58e804c94b69fdadf

ภาษา: English-Great Britain (EN-GB)

ดูในเว็บของวิทยาศาสตร์ | ดูบนเว็บไซต์ของสำนักพิมพ์ | บทความในเว็บของวิทยาศาสตร์

บทคัดย่อ

Computational approaches have an important role to play in understanding the complex process of speech acquisition, in general, and have recently been popular in studies of vocal learning in particular. In this article we suggest that two significant problems associated with imitative vocal learning of spoken language, the speaker normalisation and phonological correspondence problems, can be addressed by linguistically grounded auditory perception. In particular, we show how the articulation of consonant–vowel syllables may be learnt from auditory percepts that can represent either individual utterances by speakers with different vocal tract characteristics or ideal phonetic realisations. The result is an optimisation-based implementation of vocal exploration – incorporating semantic, auditory, and articulatory signals – that can serve as a basis for simulating vocal learning beyond imitation. © 2023 The Author(s)

คำสำคัญ

Computational modeling, Speech Processing, Speech Synthesis