000 03174nam a22005535i 4500
001 978-981-10-3734-4
003 DE-He213
005 20220801222141.0
007 cr nn 008mamaa
008 170408s2017 si | s |||| 0|eng d
020 _a9789811037344
_9978-981-10-3734-4
024 7 _a10.1007/978-981-10-3734-4
_2doi
050 4 _aTK5102.9
072 7 _aTJF
_2bicssc
072 7 _aUYS
_2bicssc
072 7 _aTEC008000
_2bisacsh
072 7 _aTJF
_2thema
072 7 _aUYS
_2thema
082 0 4 _a621.382
_223
100 1 _aHinterleitner, Florian.
_eauthor.
_4aut
_4http://id.loc.gov/vocabulary/relators/aut
_959978
245 1 0 _aQuality of Synthetic Speech
_h[electronic resource] :
_bPerceptual Dimensions, Influencing Factors, and Instrumental Assessment /
_cby Florian Hinterleitner.
250 _a1st ed. 2017.
264 1 _aSingapore :
_bSpringer Nature Singapore :
_bImprint: Springer,
_c2017.
300 _aXVI, 157 p. 29 illus.
_bonline resource.
336 _atext
_btxt
_2rdacontent
337 _acomputer
_bc
_2rdamedia
338 _aonline resource
_bcr
_2rdacarrier
347 _atext file
_bPDF
_2rda
490 1 _aT-Labs Series in Telecommunication Services,
_x2192-2829
505 0 _aIntroduction -- Speech Synthesis -- Auditory and Instrumental Quality Evaluation Metrics -- Perceptual Quality Dimensions -- Influencing Factors on Perceptual Quality -- Instrumental Quality Assessment -- Requirements for the Integration of an Instrumental Quality Measure into a Concatenative TTS System -- Conclusions.
520 _aThis book reviews research towards perceptual quality dimensions of synthetic speech, compares these findings with the state of the art, and derives a set of five universal perceptual quality dimensions for TTS signals. They are: (i) naturalness of voice, (ii) prosodic quality, (iii) fluency and intelligibility, (iv) absence of disturbances, and (v) calmness. Moreover, a test protocol for the efficient indentification of those dimensions in a listening test is introduced. Furthermore, several factors influencing these dimensions are examined. In addition, different techniques for the instrumental quality assessment of TTS signals are introduced, reviewed and tested. Finally, the requirements for the integration of an instrumental quality measure into a concatenative TTS system are examined.
650 0 _aSignal processing.
_94052
650 0 _aUser interfaces (Computer systems).
_911681
650 0 _aHuman-computer interaction.
_96196
650 1 4 _aSignal, Speech and Image Processing .
_931566
650 2 4 _aUser Interfaces and Human Computer Interaction.
_931632
710 2 _aSpringerLink (Online service)
_959979
773 0 _tSpringer Nature eBook
776 0 8 _iPrinted edition:
_z9789811037337
776 0 8 _iPrinted edition:
_z9789811037351
776 0 8 _iPrinted edition:
_z9789811099533
830 0 _aT-Labs Series in Telecommunication Services,
_x2192-2829
_959980
856 4 0 _uhttps://doi.org/10.1007/978-981-10-3734-4
912 _aZDB-2-ENG
912 _aZDB-2-SXE
942 _cEBK
999 _c80453
_d80453