Recently, research on and applications of human fixation detection in video compressed domain have gained increasing attention. However, prediction accuracy and computational complexity still remain a challenge. This paper addresses the problem of compressed domain fixations detection in the videos based on residual discrete cosine transform coefficients norm (RDCN) and Markov random field (MRF). RDCN feature is directly extracted from the compressed video with partial decoding and is normalized. After spatial–temporal filtering, the normalized map [Smoothed RDCN (SRDCN) map] is taken to the MRF model, and the optimal binary label map is obtained. Based on the label map and the center saliency map, saliency enhancement and nonsaliency inhibition are done for the SRDCN map, and the final SRDCN-MRF salient map is obtained. Compared with the similar models, we enhance the available energy functions and introduce an energy function that indicates the positional information of the saliency. The procedure is advantageous for improving prediction accuracy and reducing computational complexity. The validation and comparison are made by several accuracy metrics on two ground truth datasets. Experimental results show that the proposed saliency detection model achieves superior performances over several state-of-the-art compressed-domain and pixel-domain algorithms on evaluation metrics. Computationally, our algorithm reduces 26% more computational complexity with comparison to similar algorithms.
Although hyperspectral images (HSIs) captured by satellites provide much information in spectral regions, some bands are redundant or have large amounts of noise, which are not suitable for image analysis. To address this problem, we introduce a method for reconstructing the HSI with noise reduction and contrast enhancement using a matting model for the first time. The matting model refers to each spectral band of an HSI that can be decomposed into three components, i.e., alpha channel, spectral foreground, and spectral background. First, one spectral band of an HSI with more refined information than most other bands is selected, and is referred to as an alpha channel of the HSI to estimate the hyperspectral foreground and hyperspectral background. Finally, a combination operation is applied to reconstruct the HSI. In addition, the support vector machine (SVM) classifier and three sparsity-based classifiers, i.e., orthogonal matching pursuit (OMP), simultaneous OMP, and OMP based on first-order neighborhood system weighted classifiers, are utilized on the reconstructed HSI and the original HSI to verify the effectiveness of the proposed method. Specifically, using the reconstructed HSI, the average accuracy of the SVM classifier can be improved by as much as 19%.
Access to the requested content is limited to institutions that have purchased or subscribe to SPIE eBooks.
You are receiving this notice because your organization may not have SPIE eBooks access.*
*Shibboleth/Open Athens users─please
sign in
to access your institution's subscriptions.
To obtain this item, you may purchase the complete book in print or electronic format on
SPIE.org.
INSTITUTIONAL Select your institution to access the SPIE Digital Library.
PERSONAL Sign in with your SPIE account to access your personal subscriptions or to use specific features such as save to my library, sign up for alerts, save searches, etc.