3.8 Proceedings Paper

REPAIR: Removing Representation Bias by Dataset Resampling

Publisher

IEEE
DOI: 10.1109/CVPR.2019.00980

Keywords

-

Funding

  1. NSF [IIS-1546305, IIS-1637941]
  2. NVIDIA

Ask authors/readers for more resources

Modern machine learning datasets can have biases for certain representations that are leveraged by algorithms to achieve high peiformance without learning to solve the underlying task. This problem is referred to as representation bias. The question of how to reduce the representation biases of a dataset is investigated and a new dataset REPresentAtion bIas Removal (REPAIR) procedure is proposed. This formulates bias minimization as an optimization problem, seeking a weight distribution that penalizes examples easy for a classifier built on a given feature representation. Bias reduction is then equated to maximizing the ratio between the classification loss on the reweighted dataset and the uncertainty of the ground-truth class labels. This is a minimax problem that REPAIR solves by alternatingly updating classifier parameters and dataset re sampling weights, using stochastic gradient descent. An experimental set-up is also introduced to measure the bias of any dataset for a given representation, and the impact of this bias on the peiformance of recognition models. Experiments with synthetic and action recognition data show that dataset REPAIR can significantly reduce representation bias, and lead to improved generalization of models trained on REPAIRed datasets.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

3.8
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available