Please use this identifier to cite or link to this item: https://ir.swu.ac.th/jspui/handle/123456789/29395
Full metadata record
DC FieldValueLanguage
dc.contributor.authorSupmee P.
dc.contributor.authorSuwanmalai K.
dc.contributor.authorHanchoenkul N.
dc.contributor.authorSae-Bae N.
dc.contributor.authorKhomkham B.
dc.contributor.otherSrinakharinwirot University
dc.date.accessioned2023-11-15T02:08:32Z-
dc.date.available2023-11-15T02:08:32Z-
dc.date.issued2023
dc.identifier.urihttps://www.scopus.com/inward/record.uri?eid=2-s2.0-85169289781&doi=10.1109%2fJCSSE58229.2023.10202157&partnerID=40&md5=9b99f0441915f4efbe245c2f5f87f506
dc.identifier.urihttps://ir.swu.ac.th/jspui/handle/123456789/29395-
dc.description.abstractThis paper studied the identity preserving performance of the speech synthesized model when durations of speech samples in Thai language were varied. In particular, two experiments were designed to investigate such property of the model. The first experiment was set to reflect the identity preserving performance of the identity vector derived from speech synthesized model. The results suggest that better identity vector quality is achieved when the longer duration of a Thai speech signal is used as shorter speech signals result in identity vectors that are more dispersed. The second experiment was set to directly reflect the identity preserving performance of the synthesized voice signal generated from the speech synthesized model in independent speaker recognition systems. The results similarly suggest that a better identity-preserving voice signal is achieved when the longer duration of Thai speech signal is used as shorter speech signals result in synthesized voice signals with larger distances from the real voice signals. Therefore, the trade-off between usability and quality of synthesized voices must be carefully considered when developing applications from such models. In addition, the investigation framework used in this study could be used to evaluate the newly developed identity-preserving speech synthesized models. © 2023 IEEE.
dc.publisherInstitute of Electrical and Electronics Engineers Inc.
dc.subjectSpeaker recognition
dc.subjectSpeech synthesis
dc.subjectVoice quality
dc.subjectVoice signal
dc.titleEffects of Speech Duration on Preserving the Identity of Synthesized Voice
dc.typeConference paper
dc.rights.holderScopus
dc.identifier.bibliograpycitationProceedings of JCSSE 2023 - 20th International Joint Conference on Computer Science and Software Engineering. Vol , No. (2023), p.242-246
dc.identifier.doi10.1109/JCSSE58229.2023.10202157
Appears in Collections:Scopus 2023

Files in This Item:
There are no files associated with this item.


Items in SWU repository are protected by copyright, with all rights reserved, unless otherwise indicated.