Specifying immutable attributes #1798

nkrusch · 2022-07-28T19:24:59Z

nkrusch
Jul 28, 2022

Is your feature request related to a problem? Please describe.

Suppose I want to perform an evasion attack, where only a subset of attributes can be mutated by the adversary; the remaining attributes cannot be modified. A related but separate question is: how to denote the group of attributes for binarized data, where only one category can be selected at once (and selecting multiple would render the data invalid)?

In this example, values of (A, B) are immutable. (C,D,E) are binarized values of a categorical attribute after preprocessing; the rest (F, ....) can be mutated freely by the adversary:

A 🔒	B 🔒	C	D	E	F	....	label
3.5	4.2	1	0	0	1	....	0
1.8	-3	0	0	1	1	....	0

How can I setup the attack so that these constraints are guaranteed to be preserved in the generated adversarial instances?
This question is for the ART in general, and I am looking for an existing (or future) way to achieve this behavior.

Describe the solution you'd like

I would like to specify explicitly, as an attack parameter, the im/mutable attributes and similar firm constraints about relationships between attributes (if there is an existing way to achieve this behavior, please advice).

Describe alternatives you've considered

It is unclear to me currently, if the specific attacks in theory support this kind of constrained scenario (I will need to review the papers).

Assuming this can be done, then the technical alternatives are to: (A) run the attack first, then post-prune the examples that are invalid, or (B) extend the toolkit to support this behavior. Simply removing immutable attributes is not an option, because they are needed for training.

This question may be silly in black-box setting, where attacker is not supposed to know about the internals of the classifier, however, let's assume it is "common knowledge" that the data must adhere to some format, that extends beyond the classifier, and attacker is aware of this. Then it is not unreasonable to assume attacker wants to preserve these constraints.

beat-buesser · 2022-07-29T09:55:41Z

beat-buesser
Jul 29, 2022
Maintainer

Hi @nkrusch These are very interesting questions! Pleas allow me to transfer this issue to our Discussion tab to continue and see from there if we can create new feature issues.

2 replies

beat-buesser Jul 29, 2022
Maintainer

Some of ART's attacks (e.g. ProjectedGradientDescent*, HopSkipJump) support an argument mask in method generate, of the same shape as the input data, which allows defining which features can be perturbed by the attack. Are you looking for a specific attack algorithm or paper?

nkrusch Aug 3, 2022
Author

I am currently looking for this behavior for HopSkipJump and ZOO attacks. I will try your suggestions about mask.

giladpn · 2022-12-15T18:48:52Z

giladpn
Dec 15, 2022

Hi. I find this discussion interesting (thanks). Does ZooAttack support such a mask, or can it be easily modified to support such functionality?

1 reply

nkrusch Dec 16, 2022
Author

I have continued to work on this problem since. ZooAttack does not have mask support. Depending on your use case, it may be doable with some modification.

giladpn · 2022-12-16T11:06:26Z

giladpn
Dec 16, 2022

Can you please give me some tips or code on doing this for Zoo? Thank you.

…

On Fri, 16 Dec 2022, 05:46 Neea Rusch, ***@***.***> wrote: I have continued to work on this problem since. ZooAttack does not have mask support <https://github.com/Trusted-AI/adversarial-robustness-toolbox/blob/987052c405e05d458276299aafc7d47bb584e738/art/attacks/evasion/zoo.py#L204-L212>. Depending on your use case, it may be doable with some modification. — Reply to this email directly, view it on GitHub <#1798 (reply in thread)>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/AOLOROC22GVBAVN4KH2PWPLWNPQ2NANCNFSM55AJNEIQ> . You are receiving this because you commented.Message ID: <Trusted-AI/adversarial-robustness-toolbox/repo-discussions/1798/comments/4416098 @github.com>

2 replies

nkrusch Aug 23, 2023
Author

Hi @giladpn,

I have completed my experiments with ZOO attack. You can find the implementation details here and the ZOO approach specifically here. There is also an associated publication, that will appear later this year. I will add a reference to that paper, once I have it, to readme.

giladpn Aug 28, 2023

Thanks! Appreciated.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Specifying immutable attributes #1798

{{title}}

Replies: 3 comments 5 replies

{{title}}

{{title}}

{{title}}

{{title}}

{{title}}

{{title}}

{{title}}

{{title}}

Select a reply

Specifying immutable attributes #1798

nkrusch Jul 28, 2022

Replies: 3 comments · 5 replies

beat-buesser Jul 29, 2022 Maintainer

beat-buesser Jul 29, 2022 Maintainer

nkrusch Aug 3, 2022 Author

giladpn Dec 15, 2022

nkrusch Dec 16, 2022 Author

giladpn Dec 16, 2022

nkrusch Aug 23, 2023 Author

giladpn Aug 28, 2023

nkrusch
Jul 28, 2022

Replies: 3 comments 5 replies

beat-buesser
Jul 29, 2022
Maintainer

beat-buesser Jul 29, 2022
Maintainer

nkrusch Aug 3, 2022
Author

giladpn
Dec 15, 2022

nkrusch Dec 16, 2022
Author

giladpn
Dec 16, 2022

nkrusch Aug 23, 2023
Author