4.7 Article

Causal Modeling-Based Discrimination Discovery and Removal: Criteria, Bounds, and Algorithms

Journal

IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING
Volume 31, Issue 11, Pages 2035-2050

Publisher

IEEE COMPUTER SOC
DOI: 10.1109/TKDE.2018.2872988

Keywords

Discrimination discovery and removal; direct and indirect discrimination; causal modeling; path-specific effect

Funding

  1. NSF [1646654]
  2. Direct For Computer & Info Scie & Enginr
  3. Div Of Information & Intelligent Systems [1646654] Funding Source: National Science Foundation

Ask authors/readers for more resources

Anti-discrimination is an increasingly important task in data science. In this paper, we investigate the problem of discovering both direct and indirect discrimination from the historical data, and removing the discriminatory effects before the data are used for predictive analysis (e.g., building classifiers). The main drawback of existing methods is that they cannot distinguish the part of influence that is really caused by discrimination from all correlated influences. In our approach, we make use of the causal graph to capture the causal structure of the data. Then, we model direct and indirect discrimination as the path-specific effects, which accurately identify the two types of discrimination as the causal effects transmitted along different paths in the graph. For certain situations where indirect discrimination cannot be exactly measured due to the unidentifiability of some path-specific effects, we develop an upper bound and a lower bound to the effect of indirect discrimination. Based on the theoretical results, we propose effective algorithms for discovering direct and indirect discrimination, as well as algorithms for precisely removing both types of discrimination while retaining good data utility. Experiments using the real dataset show the effectiveness of our approaches.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available