A low power tone recognition for automatic tonal speech recognizer

Conference proceedings article


Authors/Editors


Strategic Research Themes

No matching items found.


Publication Details

Author listChaiwongsai J., Chiracharit W., Chamnongthai K., Miyanaga Y., Higuchi K.

PublisherInstitute of Electronics, Information and Communication Engineers

Publication year2013

Volume numberE96-A

Issue number6

Start page1403

End page1411

Number of pages9

ISSN0916-8508

eISSN1745-1337

URLhttps://www.scopus.com/inward/record.uri?eid=2-s2.0-84878541846&doi=10.1587%2ftransfun.E96.A.1403&partnerID=40&md5=14402f688633c7974c1b38c3ad6477bb

LanguagesEnglish-Great Britain (EN-GB)


View in Web of Science | View on publisher site | View citing articles in Web of Science


Abstract

This paper proposes a low power tone recognition suitable for automatic tonal speech recognizer (ATSR). The tone recognition estimates fundamental frequency (F0) only from vowels by using a new magnitude difference function (MDF), called vowel-MDF. Accordingly, the number of operations is considerably reduced. In order to apply the tone recognition in portable electronic equipment, the tone recognition is designed using parallel and pipeline architecture. Due to the pipeline and parallel computations, the architecture achieves high throughput and consumes low power. In addition, the architecture is able to reduce the number of input frames depending on vowels, making it more adaptable depending on the maximum number of frames. The proposed architecture is evaluated with words selected from voice activation for GPS systems, phone dialing options, and words having the same phoneme but different tones. In comparison with the autocorrelation method, the experimental results show 35.7% reduction in power consumption and 27.1% improvement of tone recognition accuracy (110 words comprising 187 syllables). In comparison with ATSR without the tone recognition, the speech recognition accuracy indicates 25.0% improvement of ATSR with tone recogntion (2,250 training data and 45 testing words). Copyright ฉ 2013 The Institute of Electronics, Information and Communication Engineers.


Keywords

ATSRPipeline and parallel architectureTone recognitionVowel-MDF


Last updated on 2023-29-09 at 07:35