4.7 Article

Multi-objective Grammatical Evolution of Decision Trees for Mobile Marketing user conversion prediction

Journal

EXPERT SYSTEMS WITH APPLICATIONS
Volume 168, Issue -, Pages -

Publisher

PERGAMON-ELSEVIER SCIENCE LTD
DOI: 10.1016/j.eswa.2020.114287

Keywords

Conversion Rate (CVR) prediction; Decision Trees; Explainable Artificial Intelligence (XAI); Grammatical Evolution; Lamarckian Evolution

Funding

  1. Norte Portugal Regional Operational Programme (NORTE 2020), under the PORTUGAL 2020 Partnership Agreement, through the European Regional Development Fund (ERDF) [NORTE-01-0247-FEDER017497]
  2. FCT -Fundacao para a Ciencia e Tecnologia, Portugal [UID/CEC/00319/2019]

Ask authors/readers for more resources

The increasing value of Mobile Performance Marketing due to the widespread adoption of mobile devices has led to the development of a data-driven model for predicting user conversion. A novel Multi-objective Optimization approach using Decision Trees evolved through Grammatical Evolution has shown competitive results in handling big data from real-world Demand-Side Platforms, with affordable training times and fast predictive response times.
The worldwide adoption of mobile devices is raising the value of Mobile Performance Marketing, which is supported by Demand-Side Platforms (DSP) that match mobile users to advertisements. In these markets, monetary compensation only occurs when there is a user conversion. Thus, a key DSP issue is the design of a data-driven model to predict user conversion. To handle this nontrivial task, we propose a novel Multi-objective Optimization (MO) approach to evolve Decision Trees (DT) using a Grammatical Evolution (GE), under two main variants: a pure GE method (MGEDT) and a GE with Lamarckian Evolution (MGEDTL). Both variants evolve variable-length DTs and perform a simultaneous optimization of the predictive performance and model complexity. To handle big data, the GE methods include a training sampling and parallelism evaluation mechanism. The algorithms were applied to a recent database with around 6 million records from a real-world DSP. Using a realistic Rolling Window (RW) validation, the two GE variants were compared with a standard DT algorithm (CART), a Random Forest and a state-of-the-art Deep Learning (DL) model. Competitive results were obtained by the GE methods, which present affordable training times and very fast predictive response times.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available