Publication

Unsupervised Content-Based Indexing of Sports Video Retrieval

Sept. 28, 2007

People

Share this publication

Michael Fleischman, Deb Roy

Abstract

This paper presents a methodology for automatically indexing a large corpus of broadcast baseball games using an unsupervised content-based approach. The method relies on the learning of a grounded language model which maps query terms to the nonlinguistic context to which they refer. Grounded language models are learned from a large, unlabeled corpus of video events. Events are represented using a codebook of automatically discovered temporal patterns of low level features extracted from the raw video. These patterns are associated with words extracted from the closed captioning text using a generalization of Latent Dirichlet Allocation. We evaluate the benefit of the grounded language model by extending a traditional language model based approach to information retrieval. Experimental results indicate that using a grounded language model nearly doubles performance on a held out test set.

mir09-fleischman.pdf

Unsupervised Content-Based Indexing of Sports Video Retrieval

People

Abstract

Intentional Context in Situated Language Learning

Why are verbs harder to learner than nouns? Initial insights from a computational model of situated word learning

Situated Models of Meaning for Sports Video Retrieval

Grounded Language Modeling for Automatic Speech Recognition of Sports Video

Unsupervised Content-Based Indexing of Sports Video Retrieval

People

Share this publication

Abstract

Intentional Context in Situated Language Learning

Why are verbs harder to learner than nouns? Initial insights from a computational model of situated word learning

Situated Models of Meaning for Sports Video Retrieval

Grounded Language Modeling for Automatic Speech Recognition of Sports Video