The loss-aggregation for the attacker should be `sum` not `mean` #115

rsokl · 2022-05-02T21:44:14Z

Line 195 in a954124

loss = ch.mean(losses)

Assuming that you are solving for per-datum perturbations, and not a broadcasted (or uniform) perturbation, then the loss-aggregation performed prior to backprop should be sum, and not mean. Using mean, the gradient of each perturbation in the batch is scaled by the inverse batch size, whereas the perturbation's gradient should be independent of batch size. Obviously, this does not effect methods where the gradient is normalized.

The text was updated successfully, but these errors were encountered:

cdluminate · 2022-05-08T04:55:30Z

mean = sum / N, and thus partial mean / partial input = (1/N) partial sum / partial input.
As PGD use the sign of gradient, we have sign(partial mean / partial input) = sign((1/N) partial sum / ..) = sign(partial sum/..). So mean lead to the same result as sum.

rsokl · 2022-05-08T13:42:20Z

Right, as I stated "obviously, this does not effect methods where the gradient is normalized." The point is that this happens to not affect methods like FGSM because of the signed gradient, but other methods would yield the incorrect behavior.

cdluminate · 2022-05-08T14:28:22Z

Indeed. As long as sign(grad) is not in the update equation, it will trigger weird bugs for people who want to customize new algorithms.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

The loss-aggregation for the attacker should be `sum` not `mean` #115

The loss-aggregation for the attacker should be `sum` not `mean` #115

rsokl commented May 2, 2022 •

edited

Loading

cdluminate commented May 8, 2022

rsokl commented May 8, 2022 •

edited

Loading

cdluminate commented May 8, 2022

The loss-aggregation for the attacker should be sum not mean #115

The loss-aggregation for the attacker should be sum not mean #115

Comments

rsokl commented May 2, 2022 • edited Loading

cdluminate commented May 8, 2022

rsokl commented May 8, 2022 • edited Loading

cdluminate commented May 8, 2022

The loss-aggregation for the attacker should be `sum` not `mean` #115

The loss-aggregation for the attacker should be `sum` not `mean` #115

rsokl commented May 2, 2022 •

edited

Loading

rsokl commented May 8, 2022 •

edited

Loading