4.3 Article

An evolutionary algorithm for automated machine learning focusing on classifier ensembles: An improved algorithm and extended results

Journal

THEORETICAL COMPUTER SCIENCE
Volume 805, Issue -, Pages 1-18

Publisher

ELSEVIER
DOI: 10.1016/j.tcs.2019.12.002

Keywords

Automated Machine Learning (Auto-ML); Classification; Evolutionary algorithms; Estimation of distribution algorithms

Funding

  1. Brazilian Research Council (CNPq) [150748/2017-5, 407518/2018-5]

Ask authors/readers for more resources

A large number of classification algorithms have been proposed in the machine learning literature. These algorithms have different pros and cons, and no algorithm is the best for all datasets. Hence, a challenging problem consists of choosing the best classification algorithm with its best hyper-parameter settings for a given input dataset. In the last few years, Automated Machine Learning (Auto-ML) has emerged as a promising approach for tackling this problem, by doing a heuristic search in a large space of candidate classification algorithms and their hyper-parameter settings. In this work we propose an improved version of our previous Evolutionary Algorithm (EA) - more precisely, an Estimation of Distribution Algorithm - for the Auto-ML task of automatically selecting the best classifier ensemble and its best hyper-parameter settings for an input dataset. The new version of this EA was compared against its previous version, as well as against a random forest algorithm (a strong ensemble algorithm) and a version of the well-known Auto-ML method Auto-WEKA adapted to search in the same space of classifier ensembles as the proposed EA. In general, in experiments with 21 datasets, the new EA version obtained the best results among all methods in terms of four popular predictive accuracy measures: error rate, precision, recall and F-measure. (C) 2019 Elsevier B.V. All rights reserved.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.3
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available