03282nam 22005775 450 991025432770332120200630121347.0981-10-3734-510.1007/978-981-10-3734-4(CKB)3710000001152142(DE-He213)978-981-10-3734-4(MiAaPQ)EBC4838376(PPN)200510940(EXLCZ)99371000000115214220170408d2017 u| 0engurnn|008mamaatxtrdacontentcrdamediacrrdacarrierQuality of Synthetic Speech Perceptual Dimensions, Influencing Factors, and Instrumental Assessment /by Florian Hinterleitner1st ed. 2017.Singapore :Springer Singapore :Imprint: Springer,2017.1 online resource (XVI, 157 p. 29 illus.) T-Labs Series in Telecommunication Services,2192-2810981-10-3733-7 Includes bibliographical references at the end of each chapters.Introduction -- Speech Synthesis -- Auditory and Instrumental Quality Evaluation Metrics -- Perceptual Quality Dimensions -- Influencing Factors on Perceptual Quality -- Instrumental Quality Assessment -- Requirements for the Integration of an Instrumental Quality Measure into a Concatenative TTS System -- Conclusions.This book reviews research towards perceptual quality dimensions of synthetic speech, compares these findings with the state of the art, and derives a set of five universal perceptual quality dimensions for TTS signals. They are: (i) naturalness of voice, (ii) prosodic quality, (iii) fluency and intelligibility, (iv) absence of disturbances, and (v) calmness. Moreover, a test protocol for the efficient indentification of those dimensions in a listening test is introduced. Furthermore, several factors influencing these dimensions are examined. In addition, different techniques for the instrumental quality assessment of TTS signals are introduced, reviewed and tested. Finally, the requirements for the integration of an instrumental quality measure into a concatenative TTS system are examined.T-Labs Series in Telecommunication Services,2192-2810Signal processingImage processingSpeech processing systemsUser interfaces (Computer systems)Signal, Image and Speech Processinghttps://scigraph.springernature.com/ontologies/product-market-codes/T24051User Interfaces and Human Computer Interactionhttps://scigraph.springernature.com/ontologies/product-market-codes/I18067Signal processing.Image processing.Speech processing systems.User interfaces (Computer systems)Signal, Image and Speech Processing.User Interfaces and Human Computer Interaction.006.54Hinterleitner Florianauthttp://id.loc.gov/vocabulary/relators/aut998271MiAaPQMiAaPQMiAaPQBOOK9910254327703321Quality of Synthetic Speech2289756UNINA