Characterization of Human Emotions and Preferences for Text-to-Speech Systems Using Multimodal Neuroimaging Methods
Published in IEEE Canadian Conference on Electrical and Computer Engineering (CCECE 2014), 2014
Rehman Laghari, K., Gupta, R., Arndt, S., Antons, J.-N., Möller, S. & Falk, T. H.
We investigate how listeners’ emotional responses and preferences to text-to-speech (TTS) voices can be quantified using a multimodal approach. Behavioral ratings are combined with EEG- and peripheral-physiology–based features while participants listen to TTS variants differing in prosody and signal quality. Results show reliable links between frontal EEG markers, arousal/valence reports, and user preference, suggesting neurophysiological measures as complementary indicators for TTS evaluation.
Recommended citation: Rehman Laghari, K., Gupta, R., Arndt, S., Antons, J.-N., Möller, S. & Falk, T. H. (2014, May). Characterization of Human Emotions and Preferences for Text-to-Speech Systems Using Multimodal Neuroimaging Methods. Paper presented at the IEEE Canadian Conference on Electrical and Computer Engineering (CCECE 2014), Toronto, ON, Canada. https://doi.org/10.1109/CCECE.2014.6901142
