DSpace Repository

Comparison of feature extraction for accent dependent Thai speech recognition system

Show simple item record

dc.contributor.author Tantisatirapong S.
dc.contributor.author Prasoproek C.
dc.contributor.author Phothisonothai M.
dc.date.accessioned 2021-04-05T03:05:35Z
dc.date.available 2021-04-05T03:05:35Z
dc.date.issued 2018
dc.identifier.other 2-s2.0-85057580394
dc.identifier.uri https://ir.swu.ac.th/jspui/handle/123456789/12756
dc.identifier.uri https://www.scopus.com/inward/record.uri?eid=2-s2.0-85057580394&doi=10.1109%2fCCE.2018.8465705&partnerID=40&md5=3527e08420dc09bae02d46e65b952d9c
dc.description.abstract This paper aims to compare the feature extraction methods for accent dependent Thai speech from three regions including central, southern and northeastern regions. We investigate four frequency analysis methods: i.e., Energy Spectral Density (ESD), Power Spectral Density (PSD), Mel-Frequency Cepstral Coefficients (MFCC) and Spectrogram (SPT). Radial basis function kernel based on support vector machine is used as a classifier with 5-fold cross validation. The isolated speech data sets are recorded from 30 male and 30 female participants speaking the 10 Thai digits from 0 to 9. The MFCC-based feature gives better accuracy than ESD, PSD and SPT respectively. For within the same region, the MFCC-based feature provides average accuracy of 94.9% and 99.1% for male and female voices respectively. For the three regions, the MFCC-based feature provides average accuracy of 89.34% and 93.81% for male and female voices, respectively. © 2018 IEEE.
dc.subject Electrostatic devices
dc.subject Extraction
dc.subject Power spectral density
dc.subject Radial basis function networks
dc.subject Spectral density
dc.subject Spectrographs
dc.subject Speech recognition
dc.subject Support vector machines
dc.subject Energy spectral density
dc.subject Feature extraction methods
dc.subject Mel frequency cepstral co-efficient
dc.subject Mel frequency cepstral coefficients (MFCC)
dc.subject Power spectral densities (PSD)
dc.subject Radial basis function kernels
dc.subject Spectrograms
dc.subject Speech recognition systems
dc.subject Feature extraction
dc.title Comparison of feature extraction for accent dependent Thai speech recognition system
dc.type Conference Paper
dc.rights.holder Scopus
dc.identifier.bibliograpycitation 2018 IEEE 7th International Conference on Communications and Electronics, ICCE 2018. (2018), p.322-325
dc.identifier.doi 10.1109/CCE.2018.8465705


Files in this item

Files Size Format View

There are no files associated with this item.

This item appears in the following Collection(s)

Show simple item record

Search DSpace


Advanced Search

Browse

My Account

Statistics