4.7 Article

Predict-then-optimize or predict-and-optimize? An empirical evaluation of cost-sensitive learning strategies

Journal

INFORMATION SCIENCES
Volume 594, Issue -, Pages 400-415

Publisher

ELSEVIER SCIENCE INC
DOI: 10.1016/j.ins.2022.02.021

Keywords

Cost-sensitive learning; Instance-dependent costs; Classification; Supervised learning

Funding

  1. BNP Paribas Fortis Chair in Fraud Analytics
  2. FWO [G015020N]
  3. Research Foundation - Flanders (FWO)
  4. Flemish Government - department EWI

Ask authors/readers for more resources

Predictive models are increasingly used to optimize decision-making and minimize costs. This work compared the predict-then-optimize approach with the predict-and-optimize approach in cost-sensitive classification. The key finding was that the decision-making strategy was generally more effective than training with a task-specific loss or their combination.
Predictive models are increasingly being used to optimize decision-making and minimize costs. A conventional approach is predict-then-optimize: first, a predictive model is built; then, this model is used to optimize decision-making. A drawback of this approach, however, is that it only incorporates costs in the second stage. Conversely, the predict-and-optimize approach proposes learning a predictive model by directly minimizing the cost of the downstream decision-making task. This is achieved by using a task-specific loss function incorporating the costs of different outcomes in the first stage, with the eventual aim of obtaining more cost-effective decisions in the second stage. This work compares both approaches in the context of cost-sensitive classification. Conceptually, we use the two-stage framework to categorize existing cost-sensitive learning methodologies by differentiating between methodologies for cost-sensitive model training and decision-making. Empirically, we compare and evaluate both approaches using different cost-sensitive training and decision-making methodologies, as well as both class-dependent and instance-dependent cost-sensitive methods. This is achieved using real-world data from a range of application areas and a combination of cost-sensitive and cost-insensitive performance measures. The key finding is that the decision-making strategy is generally found to be more effective than training with a task-specific loss or their combination. (C) 2022 Elsevier Inc. All rights reserved.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available