Paper
17 January 2006 Video shot retrieval using a kernel derived from a continuous HMM
Author Affiliations +
Proceedings Volume 6073, Multimedia Content Analysis, Management, and Retrieval 2006; 607311 (2006) https://doi.org/10.1117/12.650968
Event: Electronic Imaging 2006, 2006, San Jose, California, United States
Abstract
In this paper, we propose a discriminative approach for retrieval of video shots characterized by a sequential structure. The task of retrieving shots similar in content to a few positive example shots is more close to a binary classification problem. Hence, this task can be solved by a discriminative learning approach. For a content-based retrieval task the twin characteristics of rare positive example occurrence and a sequential structure in the positive examples make it attractive for us to use a learning approach based on a generative model like HMM. To make use of the positive aspects of both discriminative and generative models, we derive Fisher and Modified score kernels for a Continuous HMM and incorporate them into SVM classification framework. The training set video shots are used to learn SVM classifier. A test set video shot is ranked based on its proximity to the positive class side of hyperplane. We evaluate the performance of the derived kernels by retrieving video shots of airplane takeoff. The retrieval performance using the derived kernels is found to be much better compared to linear and RBF kernels.
© (2006) COPYRIGHT Society of Photo-Optical Instrumentation Engineers (SPIE). Downloading of the abstract is permitted for personal use only.
Atulya Velivelli, Thomas S. Huang, and Alexander Hauptmann "Video shot retrieval using a kernel derived from a continuous HMM", Proc. SPIE 6073, Multimedia Content Analysis, Management, and Retrieval 2006, 607311 (17 January 2006); https://doi.org/10.1117/12.650968
Lens.org Logo
CITATIONS
Cited by 5 scholarly publications.
Advertisement
Advertisement
RIGHTS & PERMISSIONS
Get copyright permission  Get copyright permission on Copyright Marketplace
KEYWORDS
Video

Motion models

Semantic video

Optical flow

Analytical research

Feature extraction

Classification systems

Back to Top