Paper
23 February 2005 Using Kohonen networks for WWW document classification
Filip Rudzinski, Adam Gluszek, Michal Kekez
Author Affiliations +
Abstract
This paper presents flexible solutions to the clustering and classification of World Wide Web documents. The method proposed in this paper applies the self-organizing Kohonen network known also a self-organizing map (SOM) with two-layer architecture. In this architecture documents become mapped as points on the SOM, in a geometric order that describes the similarity of their contents. This network has been learned by means of unsupervised training technique. After learning process has been completed, the network visualizes semantic relationship between input documents as two-dimensional semantic map. This map is a retrieval interface for an online WWW documents classification system. In this paper, first, the main idea of solution based on SOM has been presented. Next, the operation of this method has been illustrated with the us of synthetic data set. Finally, this technique has been tested by means of real-life WWW documents set.
© (2005) COPYRIGHT Society of Photo-Optical Instrumentation Engineers (SPIE). Downloading of the abstract is permitted for personal use only.
Filip Rudzinski, Adam Gluszek, and Michal Kekez "Using Kohonen networks for WWW document classification", Proc. SPIE 5775, Photonics Applications in Astronomy, Communications, Industry, and High-Energy Physics Experiments III, (23 February 2005); https://doi.org/10.1117/12.610756
Lens.org Logo
CITATIONS
Cited by 1 scholarly publication.
Advertisement
Advertisement
RIGHTS & PERMISSIONS
Get copyright permission  Get copyright permission on Copyright Marketplace
KEYWORDS
Neurons

Classification systems

Brain mapping

Associative arrays

Visualization

Human-machine interfaces

Network architectures

Back to Top