Publication

Modeling Driver's Speech under Stress

Jan. 1, 2003

People

Rosalind W. Picard

Grover M. Hermann Professor in Health Sciences and Technology ; Professor of Media Arts and Sciences
Raul Fernandez

Former Research Assistant

Projects

Emotion Navigation

Groups

Share this publication

Raul Fernandez, Rosalind W. Picard

Abstract

We explore the use of features derived from multiresolution analysis of speech and the Teager Energy Operator for classification of drivers' speech under stressed conditions. We apply this set of features to a database of short speech utterances to create user-dependent discriminants of four stress categories. In addition, we address the problem of choosing a suitable temporal scale for representing categorical differences in the data. This leads to two modeling approaches. In the first approach, the dynamics of the feature set within the utterance are assumed to be important for the classification task. These features are then classified using dynamic Bayesian network (DBN) models as well as a model consisting of a mixture of hidden Markov models (M-HMM). In the second approach, we define an utterance-level feature set by taking the mean value of the features across the utterance. This feature set is then modeled with a support vector machine and a multilayer perceptron classifier. We compare the performance on the sparser and full dynamic representations against a chance-level performance of 25% and obtain the best performance with the speaker-dependent mixture model (96.4% on the training set, and 61.2% on a separate testing set). We also investigate how these models perform on the speaker-independent task. Although the performance of the speaker-independent models degrades with respect to the models trained on individual speakers, the mixture model still outperforms the competing models and achieves significantly better than random recognition (80.4% on the training set, and 51.2% on a separate testing set).

03.fernandez-picard.pdf

Modeling Driver's Speech under Stress

People

Projects

Groups

Abstract

Detecting Stress During Real-World Driving Tasks Using Physiological Sensors

A Computational Model for the Automatic Recognition of Affect in Speech

Signal Processing for Recognition of Human Frustration

Expression Glasses: A Wearable Device for Facial Expression Recognition

Modeling Driver's Speech under Stress

People

Projects

Groups

Share this publication

Abstract

Detecting Stress During Real-World Driving Tasks Using Physiological Sensors

A Computational Model for the Automatic Recognition of Affect in Speech

Signal Processing for Recognition of Human Frustration

Expression Glasses: A Wearable Device for Facial Expression Recognition