holisticai.security.metrics.attribute_attack_score#
- holisticai.security.metrics.attribute_attack_score(x_train: DataFrame, x_test: DataFrame, y_train: Series, y_test: Series, attribute_attack: str, attack_train_ratio: float = 0.5, **kargs) float[source]#
Calculate the accuracy score for black box attribute attack. It is done as follows: - The attack attribute is removed from the training data. - The label is added as an input feature, and a machine learning model is trained. - The model is used to predict the removed attribute, and the prediction is compared with the actual value.
Parameters
- x_train: pd.DataFrame
The training features.
- x_test: pd.DataFrame
The testing features.
- y_train: pd.Series
The training labels.
- y_test: pd.Series
The testing labels.
- attribute_attack: str
The attribute column in the x_train dataframe to attack.
- attack_train_ratio: float
The ratio of the attack data to the training data.
kargs: aditional attributes are passed to AttributeAttackScore class
Returns
float: The accuracy score for black box attribute attack.