Reduced complexity tone classifier for automatic tonal speech recognizer

Conference proceedings article


ผู้เขียน/บรรณาธิการ


กลุ่มสาขาการวิจัยเชิงกลยุทธ์

ไม่พบข้อมูลที่เกี่ยวข้อง


รายละเอียดสำหรับงานพิมพ์

รายชื่อผู้แต่งChaiwongsai J., Chiracharit W., Chamnongthai K., Miyanaga Y., Higuchi K.

ผู้เผยแพร่Hindawi

ปีที่เผยแพร่ (ค.ศ.)2012

หน้าแรก82

หน้าสุดท้าย86

จำนวนหน้า5

ISBN9781467311571

นอก0146-9428

eISSN1745-4557

URLhttps://www.scopus.com/inward/record.uri?eid=2-s2.0-84872164430&doi=10.1109%2fISCIT.2012.6381017&partnerID=40&md5=2b292eae84e97b906417d1c6dc02ee35

ภาษาEnglish-Great Britain (EN-GB)


ดูบนเว็บไซต์ของสำนักพิมพ์


บทคัดย่อ

A tone classifier is an essential part of an automatic tonal speech recognizer (ATSR) because tonal languages recognize word meaning by tones. However, many researchers have developed a highly efficient tone recognition by using rich mathematical techniques and used the whole input speech as an input of pitch detection process. This paper proposes a reduced complexity tone classifier for the automatic tonal speech recognizer. The classifier reduces the number of input frames by detecting only the vowel signals as an input of the pitch detection, called vowel-AMDF (V-AMDF). The classifier uses a lower number of floating-point operations (FLOPs) than used in the whole input speech method. Due to the reduced number of FLOPs, this tone classifier can be suitable for portable electronic equipment. In addition, V-AMDF reduces F0 contour errors caused by the influence from neighboring syllables. This proposed classifier was tested and set by 19 Thai words, selected from voice activation for GPS system and phone dialing options. The experimental results show 86.0% recognition accuracy, and 21.8% reduction in the number of FLOPs, compared with using the whole input speech. ฉ 2012 IEEE.


คำสำคัญ

Automatic tonal speech recognizer (ATSR)floating-point operations (FLOPs)fundamental frequency (F0)vowel-AMDF (V-AMDF)


อัพเดทล่าสุด 2023-23-09 ถึง 07:36