4.7 Article

Mining globally interesting patterns from multiple databases using kernel estimation

期刊

EXPERT SYSTEMS WITH APPLICATIONS
卷 36, 期 8, 页码 10863-10869

出版社

PERGAMON-ELSEVIER SCIENCE LTD
DOI: 10.1016/j.eswa.2009.01.030

关键词

Multiple database mining; Global pattern; Multiple data source discovery

资金

  1. Australian Research Council (ARC) [DP0985456]
  2. National Natural Science Foundation of China [60496327]
  3. Research Fund for Overseas Scholars [60828005]
  4. China 973 Program [2008CB317108]
  5. China Ministry of Personnel for Overseas-Return High-level Talents
  6. Guangxi NSF
  7. [90718020]
  8. [60625204]

向作者/读者索取更多资源

When extracting knowledge (or patterns) from multiple databases, the data from different databases might be too large in volume to be merged into one database for centralized mining on one computer, the local information sources might be hidden from a global decision maker due to privacy concerns, and different local databases may have different contribution to the global pattern. Dealing with multiple databases is essentially different from mining from a single database. In multi-database mining, the global patterns must be obtained by carefully analyzing the local patterns from individual databases. In this paper, we propose a nonlinear method, named KEMGP (kernel estimation for mining global patterns), to tackle this problem, which adopts kernel estimation to synthesizing local patterns for global patterns. We also adopt a method to divide all the data in different databases according to attribute dimensionality, which reduces the total space complexity. We test our algorithm on a customer management system, where the application is to obtain all globally interesting patterns by analyzing the individual databases. The experimental results show that our method is efficient. (C) 2009 Elsevier Ltd. All rights reserved.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.7
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据