Using fNIRS to Characterize Human Perception of TTS System Quality, Comprehension, and Fluency: Preliminary Findings
Citation key gupta2013a
Author Gupta, Rishabh and Laghari, K. and Arndt, Sebastian and Schleicher, Robert and Möller, Sebastian and O'Shaughnessy, Douglas and Falk, Tiago H.
Title of Book Proc. 4th International Workshop on Perceptual Quality of Systems (PQS 2013)
Pages 73–78
Year 2013
ISBN 978-0470012352
Workshop workshop
Location Vienna, Austria
Address Vienna, Austria
Month sep
Note electronic/online
Editor Schatz, R. and Hoßfeld, Tobias
Publisher FTW
How Published full
Organization FTW
Abstract The quality of synthesized speech signals from different Text-to-Speech (TTS) systems is traditionally evaluated using subjective tests based on user ratings. Subjective testing, however, is challenging due to the variability and complexity of human perception. As such, recently there has been a shift towards exploring new objective techniques to evaluate the quality of TTS systems. In this paper, we describe our initial effort of characterizing human TTS quality perception via neurophysiological insights obtained from a neuroimaging technology called functional Near Infrared Spectroscopy (fNIRS). This approach allowed for a link between the human decision making process and the quality of different TTS systems to be established. We showed significant correlations between perceived quality and several fNIRS features related to cerebral haemodynamics. These preliminary results have helped establish the potential of fNIRS as an important tool for evaluating the quality of TTS systems. Index Terms: fNIRS, TTS, quality measurement, Quality of Experience (QoE), fluency, comprehension
Link to publication Link to original publication Download Bibtex entry

