holisticai.security.metrics.k_anonymity#

holisticai.security.metrics.k_anonymity(df, qi)[source]#

Description

Computes k-Anonymity metric. K-anonymity is a property of a dataset that indicates the re-identifiability of its records. A dataset is k-anonymous if quasi-identifiers for each person in the dataset are identical to at least k - 1 other people also in the dataset.

Parameters

dfpandas Dataframe

input dataset

qilist or numpy array

quasi identifiers

Returns

pd.Series: Computed metric.