4.4 Article

An Improved Genetic-XGBoost Classifier for Customer Consumption Behavior Prediction

Journal

COMPUTER JOURNAL
Volume -, Issue -, Pages -

Publisher

OXFORD UNIV PRESS
DOI: 10.1093/comjnl/bxad041

Keywords

Lasso model; feature selection; genetic mechanism; XGBoost algorithm; consumption behavior prediction

Ask authors/readers for more resources

In this study, a new classifier for predicting customer consumption behavior is proposed. The classifier utilizes a feature selection method based on Lasso and PCA to efficiently select relevant features and eliminate correlations between variables. An improved genetic-XGBoost algorithm is also used to optimize the prediction accuracy by adjusting XGBoost parameters and preventing the model from falling into local extremum. Experimental results demonstrate the superiority of the proposed methods over existing ones, providing a decision-making basis for enterprises to formulate better marketing strategies.
In an increasingly competitive market, predicting the customer's consumption behavior has a vital role in customer relationship management. In this study, a new classifier for customer consumption behavior prediction is proposed. The proposed methods are as follows: (i) A feature selection method based on least absolute shrinkage and selection operator (Lasso) and Principal Component Analysis (PCA), to achieve efficient feature selection and eliminate correlations between variables. (ii) An improved genetic-eXtreme Gradient Boosting (XGBoost) for customer consumption behavior prediction, to improve the accuracy of prediction. Furthermore, the global search ability and flexibility of the genetic mechanism are used to optimize the XGBoost parameters, which avoids inaccurate parameter settings by manual experience. The adaptive crossover and mutation probabilities are designed to prevent the population from falling into the local extremum. Moreover, the grape-customer consumption behavior dataset is employed to compare the six Lasso-based models from the original, normalized and standardized data sources with the Isometric Mapping, Locally Linear Embedding, Multidimensional Scaling, PCA and Kernel Principal Component Analysis methods. The improved genetic-XGBoost is compared with several well-known parameter optimization algorithms and state-of-the-art classification approaches. Furthermore, experiments are conducted on the University of California Irvine datasets to verify the improved genetic-XGBoost algorithm. All results show that the proposed methods outperform the existing ones. The prediction results provide the decision-making basis for enterprises to formulate better marketing strategies.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.4
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available