4.5 Article

MOSubdue: a Pareto dominance-based multiobjective Subdue algorithm for frequent subgraph mining

Journal

KNOWLEDGE AND INFORMATION SYSTEMS
Volume 34, Issue 1, Pages 75-108

Publisher

SPRINGER LONDON LTD
DOI: 10.1007/s10115-011-0452-y

Keywords

Graph-based data mining; Frequent subgraph mining; Subdue; Gaston; Multiobjective graph-based data mining; Pareto-based multiobjective optimization; Evolutionary multiobjective optimization

Funding

  1. Spanish Ministry of Science and Innovation (MICINN) [TIN2009-07727]
  2. EDRF
  3. MICINN [JCI-2010-07626]

Ask authors/readers for more resources

Graph-based data mining approaches have been mainly proposed to the task popularly known as frequent subgraph mining subject to a single user preference, like frequency, size, etc. In this work, we propose to deal with the frequent subgraph mining problem from multiobjective optimization viewpoint, where a subgraph (or solution) is defined by several user-defined preferences (or objectives), which are conflicting in nature. For example, mined subgraphs with high frequency are often of small size, and vice-versa. Use of such objectives in the multiobjective subgraph mining process generates Pareto-optimal subgraphs, where no subgraph is better than another subgraph in all objectives. We have applied a Pareto dominance approach for the evaluation and search subgraphs regarding to both proximity and diversity in multiobjective sense, which has incorporated in the framework of Subdue algorithm for subgraph mining. The method is called multiobjective subgraph mining by Subdue (MOSubdue) and has several advantages: (i) generation of Pareto-optimal subgraphs in a single run (ii) selection of subgraph-seeds from the candidate subgraphs based on all objectives (iii) search in the multiobjective subgraphs lattice space, and (iv) capability to deal with different multiobjective frequent subgraph mining tasks by customizing the tackled objectives. The good performance of MOSubdue is shown by performing multiobjective subgraph mining defined by two and three objectives on two real-life datasets.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.5
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available