Paper
12 March 2002 Using data mining to minimize database reverse engineering constraints
Aziz Barbar, Martine Collard
Author Affiliations +
Abstract
In this paper we propose to use data mining techniques for database reverse engineering process. A crucial problem in this process concerns the discovery of similarity between attributes before constructing the conceptual model. The essence of our approach is to mine user queries collected on the database in order to extract specific similarity measure that we call distance between 2 attributes. Indeed most database reverse engineering methods are based on the observation of several sources which generally are the existing database schema, the data themselves and application programs including queries. Unlike previous propositions which analyze only the structure of joins in queries, the main idea of this paper is to exploit the large volume of information stored in queries in order to extract some semantic properties on attributes. Thus we propose to apply a data mining algorithm on a query base collected on the data. The objective is to extract semantic links that do not appear obviously in the schema or in the data and are suggested implicitly by expert users in their queries. In this paper, we focus mainly on the problem of attribute similarity which is quite important in database reverse engineering. We describe a method by which similarities between attributes are discovered according to context measures without taking into consideration the naming policy used by database designers.
© (2002) COPYRIGHT Society of Photo-Optical Instrumentation Engineers (SPIE). Downloading of the abstract is permitted for personal use only.
Aziz Barbar and Martine Collard "Using data mining to minimize database reverse engineering constraints", Proc. SPIE 4730, Data Mining and Knowledge Discovery: Theory, Tools, and Technology IV, (12 March 2002); https://doi.org/10.1117/12.460226
Advertisement
Advertisement
RIGHTS & PERMISSIONS
Get copyright permission  Get copyright permission on Copyright Marketplace
KEYWORDS
Databases

Reverse engineering

Data mining

Process engineering

Nomenclature

Distance measurement

Reverse modeling

RELATED CONTENT

A topological-based spatial data clustering
Proceedings of SPIE (April 20 2016)
Data modeling for data mining
Proceedings of SPIE (March 12 2002)
Model construction with key identification
Proceedings of SPIE (February 25 1999)
Feature transformations and structure of attributes
Proceedings of SPIE (March 12 2002)
Interactive mining of schema for semistructured data
Proceedings of SPIE (March 12 2002)

Back to Top