4.7 Article

Adversarial learning for counterfactual fairness

Journal

MACHINE LEARNING
Volume 112, Issue 3, Pages 741-763

Publisher

SPRINGER
DOI: 10.1007/s10994-022-06206-8

Keywords

Counterfactual fairness; Adversarial neural network; Causal inference

Ask authors/readers for more resources

Fairness has been an important topic in machine learning research community, and counterfactual fairness focuses on achieving fairness at the individual level. Existing approaches rely on variational auto-encoding and maximum mean discrepancy penalization, but have limitations. This work proposes an adversarial neural learning approach to improve counterfactual fairness.
In recent years, fairness has become an important topic in the machine learning research community. In particular, counterfactual fairness aims at building prediction models which ensure fairness at the most individual level. Rather than globally considering equity over the entire population, the idea is to imagine what any individual would look like with a variation of a given attribute of interest, such as a different gender or race for instance. Existing approaches rely on Variational Auto-encoding of individuals, using Maximum Mean Discrepancy (MMD) penalization to limit the statistical dependence of inferred representations with their corresponding sensitive attributes. This enables the simulation of counterfactual samples used for training the target fair model, the goal being to produce similar outcomes for every alternate version of any individual. In this work, we propose to rely on an adversarial neural learning approach, that enables more powerful inference than with MMD penalties, and is particularly better fitted for the continuous setting, where values of sensitive attributes cannot be exhaustively enumerated. Experiments show significant improvements in term of counterfactual fairness for both the discrete and the continuous settings.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available