☆ 4.7 Article

Predict-then-optimize or predict-and-optimize? An empirical evaluation of cost-sensitive learning strategies

INFORMATION SCIENCES (2022)

Journal

INFORMATION SCIENCES

Volume 594, Issue -, Pages 400-415

Publisher

ELSEVIER SCIENCE INC

DOI: 10.1016/j.ins.2022.02.021

Keywords

Cost-sensitive learning; Instance-dependent costs; Classification; Supervised learning

Funding

BNP Paribas Fortis Chair in Fraud Analytics
FWO [G015020N]
Research Foundation - Flanders (FWO)
Flemish Government - department EWI

Ask authors/readers for more resources

Protocol

Community support

Reagent

Community support

Automated Summary New
Abstract

Predictive models are increasingly used to optimize decision-making and minimize costs. This work compared the predict-then-optimize approach with the predict-and-optimize approach in cost-sensitive classification. The key finding was that the decision-making strategy was generally more effective than training with a task-specific loss or their combination.

Predictive models are increasingly being used to optimize decision-making and minimize costs. A conventional approach is predict-then-optimize: first, a predictive model is built; then, this model is used to optimize decision-making. A drawback of this approach, however, is that it only incorporates costs in the second stage. Conversely, the predict-and-optimize approach proposes learning a predictive model by directly minimizing the cost of the downstream decision-making task. This is achieved by using a task-specific loss function incorporating the costs of different outcomes in the first stage, with the eventual aim of obtaining more cost-effective decisions in the second stage. This work compares both approaches in the context of cost-sensitive classification. Conceptually, we use the two-stage framework to categorize existing cost-sensitive learning methodologies by differentiating between methodologies for cost-sensitive model training and decision-making. Empirically, we compare and evaluate both approaches using different cost-sensitive training and decision-making methodologies, as well as both class-dependent and instance-dependent cost-sensitive methods. This is achieved using real-world data from a range of application areas and a combination of cost-sensitive and cost-insensitive performance measures. The key finding is that the decision-making strategy is generally found to be more effective than training with a task-specific loss or their combination. (C) 2022 Elsevier Inc. All rights reserved.

Predict-then-optimize or predict-and-optimize? An empirical evaluation of cost-sensitive learning strategies

Journal

INFORMATION SCIENCES

Publisher

ELSEVIER SCIENCE INC

Keywords

Categories

Funding

Ask authors/readers for more resources

Protocol

Reagent

Authors

I am an author on this paper

Reviews

Primary Rating

Secondary Ratings

Novelty

Significance

Scientific rigor

Rate this paper

Recommended

Predict-then-optimize or predict-and-optimize? An empirical evaluation of cost-sensitive learning strategies

Journal

INFORMATION SCIENCES

Publisher

ELSEVIER SCIENCE INC

Keywords

Categories

Funding

Ask authors/readers for more resources

Protocol

Reagent

Authors

I am an author on this paper

Reviews

Primary Rating

Secondary Ratings

Novelty

Significance

Scientific rigor

Rate this paper

Recommended

Export Citation

Share Paper