Journal
INTEGRATED COMPUTER-AIDED ENGINEERING
Volume 22, Issue 1, Pages 21-39Publisher
IOS PRESS
DOI: 10.3233/ICA-140479
Keywords
Data mining; genetic algorithms; multi-objective optimization; quantitative association rules; large scale datasets
Categories
Funding
- Spanish Ministry of Science and Technology
- University Pablo de Olavide [TIN2011-28956-C02-02, TIC-7528, P12-TIC-1728, APPB813097]
- Junta de Andalucia
Ask authors/readers for more resources
Association rule mining is a well-known methodology to discover significant and apparently hidden relations among attributes in a subspace of instances from datasets. Genetic algorithms have been extensively used to find interesting association rules. However, the rule-matching task of such techniques usually requires high computational and memory requirements. The use of efficient computational techniques has become a task of the utmost importance due to the high volume of generated data nowadays. Hence, this paper aims at improving the scalability of quantitative association rule mining techniques based on genetic algorithms to handle large-scale datasets without quality loss in the results obtained. For this purpose, a new representation of the individuals, new genetic operators and a windowing-based learning scheme are proposed to achieve successfully such challenging task. Specifically, the proposed techniques are integrated into the multi-objective evolutionary algorithm named QARGA-M to assess their performances. Both the standard version and the enhanced one of QARGA-M have been tested in several datasets that present different number of attributes and instances. Furthermore, the proposed methodologies have been integrated into other existing techniques based in genetic algorithms to discover quantitative association rules. The comparative analysis performed shows significant improvements of QARGA-M and other existing genetic algorithms in terms of computational costs without losing quality in the results when the proposed techniques are applied.
Authors
I am an author on this paper
Click your name to claim this paper and add it to your profile.
Reviews
Recommended
No Data Available