A perspective on databases and data mining
We discuss the use of database methods for data mining. Recently impressive results have been achieved for some data mining problems using highly specialized and clever data structures. We study how well one can manage by using general purpose database management systems. We illustrate our ideas by investigating the use of a dbms for a well-researched area: the discovery of association rules. We present a simple algorithm, consisting of only union and intersection operations, and show that it achieves quite good performance on an efficient dbms. Our method can incorporate inheritance hierarchies to the association rule algorithm easily. We also present a technique that effectively reduces the number of database operations when searching large search spaces that contain only few interesting items. Our work shows that database techniques are promising for data mining: general architectures can achieve reasonable results.
|Systems (acm H.2.4), Information Search and Retrieval (acm H.3.3), Learning (acm I.2.6)|
|Department of Computer Science [CS]|
Holsheimer, M, Kersten, M.L, Mannila, H, & Toivonen, H. (1995). A perspective on databases and data mining. Department of Computer Science [CS]. CWI.