Publication

Transfer Learning with Real-World Nonverbal Vocalizations from Minimally Speaking Individuals

July 23, 2021

People

Projects

Commalla: Communication for All

Groups

Share this publication

Narain, J., Johnson, K., Quatieri, T., Picard, R., and Maes, P. “Transfer Learning with Real-World Vocalizations from Minimally Speaking Individuals”. Workshop in Interpretable ML in Healthcare at International Conference on Machine Learning. July 2021.

Abstract

We trained and evaluated several types of transfer learning to classify affect and communication intent of nonverbal vocalizations from eight minimally speaking individuals (mv*). Datasets were recorded in the real-world with in-the-moment labels from a close family member. We trained deep neural nets (DNNs) on six audio datasets (including our dataset of nonverbal vocalizations) and then fine-tuned the models to classify affect and intent for each individual. We also evaluated a zero-shot approach for arousal and valence regression using an acted dataset of nonverbal vocalizations that occur amidst typical speech. For two of the eight mv* communicators, fine-tuning improved model performance compared to fully personalized DNNs and there were weak groupings in arousal values inferred using zero-shot learning. The limited success of the evaluated transfer learning approaches highlights the need for specialized datasets with mv* individuals.

24_CameraReady_IMLH_2021_CameraReady_20210714.pdf

Personalized Modeling of Real-World Vocalizations from Nonverbal Individuals

Narain, J.*, Johnson, K.T.*, Ferguson, C., O’Brien, A., Talkar, T., Zhang, Y., Wofford, P., Quatieri, T., Picard, R.W.,Maes, P., "Personalized Modeling of Real-World Vocalizations from Nonverbal Individuals," Proceedings of the International Conference on Multimodal Interaction (ICMI), Utrecht, Netherlands, October 2020. (*Co-first authors/Equal contribution)

Publication Research

Augmenting Natural Communication in Nonverbal Individuals with Autism

Johnson, K.T.* & Narain, J.*, Maes, P., Picard, R.W., "Augmenting Natural Communication in Nonverbal Individuals with Autism," International Society for Autism Research (INSAR), Seattle, Washington, May 2020. (*Co-first authors/equal contribution)

Publication Research

Nonverbal Vocalizations as Speech: Characterizing Natural-Environment Audio from Nonverbal Individuals with Autism

Narain, J.*, Johnson, K.T.*, O’Brien, A., Wofford, P., Maes, P., Picard, R.W. "Nonverbal Vocalizations as Speech: Characterizing Natural-Environment Audio from Nonverbal Individuals with Autism," Proceedings of the Workshop on Laughter and Other Nonverbal Vocalisations, Bielefeld, Germany, October 2020. (*Co-first authors/Equal contribution)

Publication Research

The ECHOS Platform to Enhance Communication for Nonverbal Children with Autism: A Case Study

Johnson, K.T.*, Narain, J.*, Ferguson, C., Picard, R.W., and Maes, P. The ECHOS Platform to Enhance Communication for Nonverbal Children with Autism: A Case Study. Extended Abstracts of the 2020 CHI Conference on Human Factors in Computing Systems (Case Studies Track of CHI 2020). (*Co-first authors/Equal contribution)

Transfer Learning with Real-World Nonverbal Vocalizations from Minimally Speaking Individuals

People

Projects

Groups

Share this publication

Abstract

Personalized Modeling of Real-World Vocalizations from Nonverbal Individuals

Augmenting Natural Communication in Nonverbal Individuals with Autism

Nonverbal Vocalizations as Speech: Characterizing Natural-Environment Audio from Nonverbal Individuals with Autism

The ECHOS Platform to Enhance Communication for Nonverbal Children with Autism: A Case Study