Inhalt des Dokuments
zur Navigation
Work
Research Field
quality, speech technology
Research Topics
quality and quality dimensions of synthetic speech
instrumental quality prediction of synthetic speech
Biography
Florian Hinterleitner studied Communication and Computer Sciene at the Technical University Berlin. In 2010 he completed his Magister Thesis "Signalbased Quality Prediction of Synthetic Speech". He is currently working as a research assistant at the Quality and Usability Lab of Telekom Innovation Laboratories, TU-Berlin in the domain of quality prediction of synthetic speech.
Projects
Teaching
Speech Communication (since winter semester 2010/2011)
Publications
Zitatschlüssel | hinterleitner2013a |
---|---|
Autor | Hinterleitner, Florian and Norrenbrock, Christoph and Möller, Sebastian |
Buchtitel | 8th ISCA Speech Synthesis Workshop |
Seiten | 167–171 |
Jahr | 2013 |
Ort | Barcelona, Spain |
Monat | aug |
Zusammenfassung | In this paper, we present a comparative overview of 9 studies on perceptual quality dimensions of synthetic speech. Differ-ent subjective assessment techniques have been used to evalu-ate the text-to-speech (TTS) stimuli in each of these tests: in a semantic differential, the test participants rate every stimulus on a given set of rating scales, while in a paired comparison test, the subjects rate the similarity of pairs of stimuli. Percep-tual quality dimensions can be derived from the results of both test methods, either by performing a factor analysis or via mul-tidimensional scaling. We show that even though the 9 tests differ in terms of used synthesizer types, stimulus duration, lan-guage, and quality assessment methods, the resulting perceptual quality dimensions can be linked to 5 universal quality dimen-sions of synthetic speech: (i) naturalness of voice, (ii) prosodic quality, (iii) fluency and intelligibility, (iv) disturbances, and (v) calmness |