Binary Classification#
Equality of Outcome Metrics#
The idea of equality of outcome metrics in classification tasks, is to compare the rate of success in the privileged group with the rate of success in the unprivileged group. We define the success rate \(SR_{g}\) of a group \(g\) as the ratio of the number of positive outcomes to the total number of outcomes in that group:
where \(TP_{g}\) is the number of true positives in group \(g\) and \(FP_{g}\) is the number of false positives in group \(g\).
Note
The idea is that an unbiased system would present roughly similar success rates across groups.
We will refer to the success rate for the unprivileged group as \(SR_{b}\) and the success rate for the privileged group as \(SR_{a}\). We can then define the following metrics.
Disparate Impact (DI): measures the ratio of success rates. The ideal value is 1.
The acceptable range is [0.8, 1.2]. Values below 0.8 are unfair towards group_a. Values above 1.2 are unfair towards group_b.
Statistical Parity (SP): measures the difference between success rates.
A negative value means that the unprivileged group_b is unfavoured. Ideal value is 0. Negative values are unfair towards group b.
Cohen’s D (CD): measures the effect size of the difference between success rates.
Ideal value is 0. Positive values are unfair towards group b. Reference values: 0.2 is considered a small effect size, 0.5 is considered medium, 0.8 is considered large.
where \(poolStd\) is the pooled standard deviation of the success rates, defined as
where \(n_{a}\) and \(n_{b}\) are the number of samples in groups \(a\) and \(b\), respectively, and \(\sigma^{2}_{a}\) and \(\sigma^{2}_{b}\) are the variances of the success rates in groups \(a\) and \(b\), respectively.
2-SD Rule: measures the difference between success rates in terms of standard deviations.
The ideal value is 0. Positive values are unfair towards group b.
Four-Fifths Rule: measures the ratio of success rates.
The ideal value is 1. Values below 0.8 and above 1.2 are considered unfair towards group_b.
Equality of Opportunity Metrics#
The idea of equality of opportunity metrics, is to compare true positives and/or false positives rates across groups. We define the true positive rate \(TPR_{g}\) of a group \(g\) as the ratio of the number of true positives to the total number of actual positives in that group:
where \(FN_{g}\) is the number of false negatives in group \(g\).
Equality of Opportunity Difference: measures the difference between true positive rates.
Ideal value: 0 and Fair area: [-0.1, 0.1]
where \(TPR_{g}\) is the true positive rate of group \(g\).
False Positive Rate Difference: measures the difference between false positive rates.
The ideal value is 0. Positive values are unfair towards group b.
where \(FPR_{g}\) is the false positive rate of group \(g\).
Average Odds Difference measures the average of the difference between true positive rates and false positive rates.
Ideal value: 0 and Fair area: [-0.1, 0.1]
Accuracy Difference: measures the difference between accuracy rates.
The ideal value is 0. Positive values are unfair towards group b.
where \(ACC_{g}\) is the accuracy of group \(g\).