Disparate Impact Remover#
Note
Learning tasks: Binary classification, Multiclassification, Regression, Clustering, Recommender systems.
Introduction#
The Disparate Impact Remover method addresses the issue of unintentional bias in algorithmic decision-making processes. Disparate impact occurs when a selection process yields significantly different outcomes for different groups, even if the process appears neutral. This method is particularly relevant in contexts where algorithms are used for critical decisions, such as hiring, lending, and sentencing, and aims to ensure fairness by modifying the data used by these algorithms.
Description#
Problem definition
The problem of disparate impact is defined using the “80% rule” by the US Equal Employment Opportunity Commission (EEOC). Given a dataset \(D = (X, Y, C)\), where \(X\) is a protected attribute (e.g., race, sex, religion), \(Y\) represents the remaining attributes, and \(C\) is the binary class to be predicted (e.g., “will hire”), the dataset is said to have disparate impact if:
\[\frac{Pr(C = \text{YES} | X = 0)}{Pr(C = \text{YES} | X = 1)} \leq \tau = 0.8\]Here, \(Pr(C = c | X = x)\) denotes the conditional probability that the class outcome is \(c\) given the protected attribute \(x\).
Main features
The Disparate Impact Remover method focuses on two main problems:
Disparate Impact Certification: Ensuring that any classification algorithm predicting some \(C'\) from \(Y\) does not exhibit disparate impact.
Disparate Impact Removal: Modifying the dataset \(D\) to produce a new dataset \(\bar{D} = (X, \bar{Y}, C)\) that can be certified as not having disparate impact, while preserving the ability to classify as much as possible.
Step-by-step description of the approach
Linking Disparate Impact to Classification Accuracy: The method links disparate impact to the balanced error rate (BER), showing that any decision exhibiting disparate impact can be converted into one where the protected attribute can be predicted with low BER.
Certification Procedure: A procedure is proposed to certify the impossibility of disparate impact on a dataset. This involves using a regression algorithm that minimizes BER, connecting BER to disparate impact in various settings (point and interval estimates, and distributions).
Data Transformation: The input dataset is transformed to make the predictability of the protected attribute impossible. This transformation aims to preserve much of the signal in the unprotected attributes and maintain closeness to the original data distribution.
Basic Usage#
You can find an example of using the DisparateImpactRemover method in the following demo.
Read more about the class attributes and methods in the API reference: DisparateImpactRemover.
References#
Feldman, Michael, et al. “Certifying and removing disparate impact.” proceedings of the 21th ACM SIGKDD international conference on knowledge discovery and data mining. 2015.