The ubiquity of the Internet has brought about an increasing amount of multi-formatted Web documents. Although image occupies a large part of importance on these increasing Web documents, there have not been many researches for analyzing and understanding it. Many Web images are used for carrying important information but others are not used for it. If images in a Web document can be classified by which have particular information or not, then it would be very useful for analysis and multi-formatting of Web documents. In this paper we introduce the machine learning based methods of classifying Web images as either eliminable or non-eliminable. For this research, we have detected 16 special and rich features for Web images and experimented by using the Bayesian and decision tree methods. As the results, F-measures of 87.09%, 82.72% were achieved for each method and particularly, from the experiments to compare the effects of feature groups, it has proved that the selected features on this study are very useful for Web image classification.
Access to the requested content is limited to institutions that have purchased or subscribe to SPIE eBooks.
You are receiving this notice because your organization may not have SPIE eBooks access.*
*Shibboleth/Open Athens users─please
sign in
to access your institution's subscriptions.
To obtain this item, you may purchase the complete book in print or electronic format on
SPIE.org.
INSTITUTIONAL Select your institution to access the SPIE Digital Library.
PERSONAL Sign in with your SPIE account to access your personal subscriptions or to use specific features such as save to my library, sign up for alerts, save searches, etc.