SHAPr: SHapley vAlue-based Privacy Risk#
Definition#
SHAPr is a membership privacy metric based on Shapley values, originally intended to measure the contribution of a training data record on model utility. SHAPr approximates a leave-one-out (LOO) technique to estimate the membership privacy risk of individual training data records in machine learning models.
Mathematical Formulation#
The Shapley value \((\phi_i)\) of a data record \((z_i = (x_i, y_i))\) is defined as:
where (S) is a randomly chosen subset of \(D_{tr} \setminus \{z_i\}\) and (U(S)) is the utility of the model trained on subset (S).
Interpretation#
Positive SHAPr Score (phi > 0): Indicates that the data record contributed positively to the model’s utility, suggesting a higher likelihood of memorization and, therefore, a higher membership privacy risk.
Negative SHAPr Score (phi < 0): Indicates that the data record was either harmful or not significantly contributing to the model’s utility, suggesting lower susceptibility to membership inference attacks.
Zero SHAPr Score (phi = 0): Implies that the presence or absence of the data record does not affect the model’s utility, indicating no membership privacy risk.
Reference#
For more details, see the paper SHAPr: An Efficient and Versatile Membership Privacy Risk Metric.