Video Content Analysis Using Multimodal Information: For Movie Content Extraction, Indexing and Representation

前表紙
Springer Science & Business Media, 2003/06/30 - 194 ページ
With the fast growth ofmultimedia information, content-based video anal- ysis, indexing and representation have attracted increasing attention in re- cent years. Many applications have emerged in these areas such as video- on-demand, distributed multimedia systems, digital video libraries, distance learning/education, entertainment, surveillance and geographical information systems. The need for content-based video indexing and retrieval was also rec- ognized by ISOIMPEG, and a new international standard called "Multimedia Content Description Interface" (or in short, MPEG-7)was initialized in 1998 and finalized in September 2001. In this context, a systematic and thorough review ofexisting approaches as well as the state-of-the-art techniques in video content analysis, indexing and representation areas are investigated and studied in this book. In addition, we will specifically elaborate on a system which analyzes, indexes and abstracts movie contents based on the integration ofmultiple media modalities. Content ofeach part ofthis book is briefly previewed below. In the first part, we segment a video sequence into a set ofcascaded shots, where a shot consistsofone or more continuouslyrecorded image frames. Both raw and compressedvideo data will beinvestigated. Moreover, consideringthat there are always non-story units in real TV programs such as commercials, a novel commercial break detection/extraction scheme is developed which ex- ploits both audio and visual cues to achieve robust results. Specifically, we first employ visual cues such as the video data statistics, the camera cut fre- quency, and the existenceofdelimiting black frames between commercials and programs, to obtain coarse-level detection results.
 

目次

INTRODUCTION
1
1 Audiovisual Content Analysis
2
2 Video Indexing Browsing and Abstraction
3
3 MPEG7 Standard
4
4 Roadmap of The Book
6
BACKGROUND AND PREVIOUS WORK
11
2 Audio Content Analysis
18
3 Speaker Identification
20
SPEAKER IDENTIFICATION FOR MOVIES
97
1 Supervised Speaker Identification for Movie Dialogs
98
2 Adaptive Speaker Identification
105
3 Experimental Results
117
SCENEBASED MOVIE SUMMARIZATION
133
1 An Overview of the Proposed System
134
3 Scalable Movie Summarization and Navigation
143
4 Experimental Results
145

4 Video Abstraction
22
5 Video Indexing and Retrieval
32
VIDEO CONTENT PREPROCESSING
35
1 Shot Detection in Raw Data Domain
36
2 Shot Detection in Compressed Domain
47
3 Audio Feature Analysis
50
4 Commercial Break Detection
55
5 Experimental Results
63
CONTENTBASED MOVIE SCENE AND EVENT EXTRACTION
69
1 Movie Scene Extraction
70
2 Movie Event Extraction
84
3 Experimental Results
91
EVENTBASED MOVIE SKIMMING
153
2 An Overview of the Proposed System
155
4 Extended Event Feature Extraction
157
5 Video Skim Generation
159
6 More Thoughts on the Video Skim
160
7 Experimental Results
165
CONCLUSION AND FUTURE WORK
169
2 Future Work
171
References
179
Index
193
著作権

他の版 - すべて表示

多く使われている語句

人気のある引用

189 ページ - Video skimming and characterization through the combination of image and language understanding techniques," in Proceedings of IEEE Conference on Computer Vision and Pattern Recognition, 1997, pp.
188 ページ - Robust TextIndependent Speaker Identification Using Gaussian Mixture Speaker Models", IEEE Transactions on Speech and Audio Processing, vol.
183 ページ - An Integrated Scheme for Automated Video Abstraction Based on Unsupervised Cluster-Validity Analysis", IEEE Trans, on Circuits and Systems for Video Technology, vol.9, December 1999.
184 ページ - A Layered Video Object Coding System Using Sprite and Affine Motion Model", IEEE Trans in Circuits and Systems for Video Technology, Vol7, Nol,1997.
190 ページ - Content-based video parsing and indexing based on audio-visual interaction," IEEE Transactions on Circuits and Systems for Video Technology, vol. 11, no. 4, pp. 522-535, 2001. [5] D. Li, G. Wei, IK Sethi, and N. Dimitrova, "Person identification in TV programs," Journal of Electronic Imaging, vol.

書誌情報