Handbook of Video Databases: Design and Applications (Internet and Communications)

Ahmet Ekin and A. Murat Tekalp Department of Electrical and Computer EngineeringUniversity of Rochester, Rochester, NY, 14627 USA {ekin,tekalp}@ece.rochester.edu

1. Introduction

In the last decade, several technological developments enabled the creation, compression, and storage of large amounts of digital information, such as video, images, and audio. With increasing usage of Internet and wireless communications, ubiquitous consumption of such information poses several problems. We address two of those: efficient content-based indexing (description) of video information for fast search, and summarization of video in order to enable delivery of the essence of the content over low-bitrate channels. The description and summarization problems have also been the focus of recently completed ISO MPEG-7 standard [1], formally Multimedia Content Description Interface, which standardizes a set of descriptors and description schemes. However, MPEG-7 does not normatively specify media processing techniques to extract these descriptors or to summarize the content, which are the main goals of this chapter. We also propose a generic integrated semantic-syntactic event model for search applications, which we believe is more powerful than the MPEG-7 Semantic Description Scheme.

Although the proposed semantic-syntactic event model is generic, its automatic instantiation for generic video and automatic generation of summaries that capture the essence for generic video are not simple. Hence, for video analysis, we focus on sports video; in particular soccer video, since sports video appeals to large audiences and its distribution over various networks should contribute to quick adoption and widespread usage of multimedia services worldwide. The model should provide a mapping between the low-level video features and high-level events and objects, which can be met for certain objects and events in the domain of sports video, and the processing for summary generation should be automatic, and in real, or near real-time.

Sports video is consumed in various scenarios, where network bandwidth and processing time determine the depth of the analysis and description. To this effect, we classify four types of users:

The next section briefly explains related works in video modeling, description, and analysis. We present a generic semantic-syntactic model to describe sports video for search applications in Section 3. In Section 4, we introduce a sports video analysis framework for video summarization and instantiation of the proposed model. The summaries can be low-level summaries computed on the server (mobile user) or customized summaries computed at the user side (TV user). The model instantiation enables search applications (web user or professional user). Finally, in Section 5, we describe a graph-based querying framework for video search and browsing with the help of the proposed model.

Категории