Publication

Mental Imagery for a Conversational Robot

June 1, 2004

People

Share this publication

Deb Roy, Kai-Yuh Hsiao, Nikolaos Mavridis

Abstract

To build robots that engage in fluid face-to-face spoken conversations with people, robots must have ways to connect what they say to what they see. A critical aspect of how language connects to vision is that language encodes points of view. The meaning of my left and your left differs due to an implied shift of visual perspective. The connection of language to vision also relies on object permanence. We can talk about things that are not in view. For a robot to participate in situated spoken dialog, it must have the capacity to imagine shifts of perspective, and it must maintain object permanence. We present a set of representations and procedures that enable a robotic manipulator to maintain a “mental model” of its physical environment by coupling active vision to physical simulation. Within this model, “imagined” views can be generated from arbitrary perspectives, providing the basis for situated language comprehension and production. An initial application of mental imagery for spatial language understanding for an interactive robot is described.

ieee_smc_04.pdf

Mental Imagery for a Conversational Robot

People

Abstract

Conversational Robots: Building Blocks for Grounding Word Meanings

The Proceedings of IEEE/RSJ International Conference on Intelligent Robots and Systems

Grounded Situation Models for Robots: Bridging language, Perception, and Action

Grounded Situation Models for Robots: Where Words and Percepts Meet

Mental Imagery for a Conversational Robot

People

Share this publication

Abstract

Conversational Robots: Building Blocks for Grounding Word Meanings

The Proceedings of IEEE/RSJ International Conference on Intelligent Robots and Systems

Grounded Situation Models for Robots: Bridging language, Perception, and Action

Grounded Situation Models for Robots: Where Words and Percepts Meet