Input wise masks for mask gradients #4

oliveradk · 2024-07-12T19:07:03Z

Adds support for computing input-wise mask gradients. Useful for e.g. doing anomaly detection using edge attribution scores

…ption to return patch src_outs

UFO-101

Thanks for the PR! This is a nice idea, although I think executing it in a clean way is somewhat challenging.

I'm nervous about a couple of things here

It's not ideal to create a new patch_mask_batch parameter. It seems this should be integrated into the existing mask parameter. Changing the mask parameter for individual edges would no longer work because the main mask parameter would no longer be used.
But some existing functions will break if you just overwrite the mask parameter with the new batch dimension (because edge indices will no longer work.)
The patch_mode context manager shouldn't permanently alter the state of the model.

I would propose making this change in two steps

Add a new dummy batch dimension (with size 1) on to the mask parameters that is always present. Update the edge index functions to work as before with this. The core logic in PatchWrapperImpl can stay mostly the same - we will just broadcast the batch dimension by default. (and update sample_hard_concrete to not add a new dimension.)
Add a new context manager in graph_utils, set_mask_batch_size that temporarily adjusts the size of the mask batch dimension, allowing you to get the gradients for each batch element separately.

auto_circuit/data.py

auto_circuit/utils/graph_utils.py

auto_circuit/utils/patchable_model.py

oliveradk · 2024-07-15T18:32:29Z

Thanks for the feedback! I mostly implemented your suggestions, but instead of adding a dimension to patch_mask by default, only add the batch dimension if set_mask_batch_size is called - this minimizes the chance of any downstream bugs being introduced, and I don't think any of the edge indexing functionality is critical if the main use case is to collect attribution scores over batches (please let me know if I'm missing something crucial there)

UFO-101

Thanks for the changes! I think this a big improvement.

I'm still not super happy with this implementation as it breaks a bunch of other features. But I recognize that a more general solution is a bigger project which is not relevant to your usecase so I don't want to block this too much longer.

Main requests are to add a warning about these problems in the docstrings. And to improve the test a little bit.

UFO-101 · 2024-07-16T22:50:59Z

auto_circuit/utils/tensor_ops.py

+    if not mask_expanded:
+        mask = mask.repeat(batch_size, *([1] * mask.ndim))
+    else:
+        assert mask.size(0) == batch_size


Instead of adding a new parameter, it would be better to just check if it's already the correct shape and adjust accordingly.

some of the masks are 1d though, so unclear how to distinguish an expanded 1d mask and a 2d mask where the first dimension happens to equal the batch size

Ah yes, good point. Seems fine how it is then.

auto_circuit/utils/patch_wrapper.py

auto_circuit/utils/graph_utils.py

tests/utils/test_instance_grads.py

UFO-101

Looks great, thanks for this!

UFO-101 · 2024-07-18T23:02:06Z

I will just do a couple of checks locally and then merge in next hour or so

UFO-101 · 2024-07-19T00:37:26Z

@oliveradk Could you please fix merge conflicts and final comment? Then I will merge.

tests/utils/test_instance_grads.py

oliveradk · 2024-07-19T03:05:11Z

Fixed merged conflicts and addressed comment (I had been running the test incorrectly and had to make some more tweaks)

oliveradk added 3 commits July 10, 2024 11:05

make collate_fn a passable argument

38b479a

pass ablation type and clean_corrupt to mask_gradient prune scores, o…

9b8bba0

…ption to return patch src_outs

support for input-wise masks (for mask gradient methods)

9568d25

oliveradk marked this pull request as draft July 12, 2024 21:15

UFO-101 requested changes Jul 13, 2024

View reviewed changes

auto_circuit/data.py Outdated Show resolved Hide resolved

auto_circuit/utils/graph_utils.py Outdated Show resolved Hide resolved

auto_circuit/utils/patchable_model.py Outdated Show resolved Hide resolved

oliveradk added 3 commits July 15, 2024 12:16

revert unused changes

55aa4d8

set_mask_batch_size

bf25854

(bug fix - dropout on mask, not self.patch_mask)

252ed1c

oliveradk marked this pull request as ready for review July 15, 2024 18:26

UFO-101 requested changes Jul 17, 2024

View reviewed changes

oliveradk added 2 commits July 18, 2024 14:40

warnings to docstrings, improved test

e6f5877

comment on collecting prune scores in batches

7c137bf

UFO-101 approved these changes Jul 18, 2024

View reviewed changes

UFO-101 reviewed Jul 19, 2024

View reviewed changes

tests/utils/test_instance_grads.py Outdated Show resolved Hide resolved

oliveradk added 2 commits July 18, 2024 22:56

fixed test, passing

f706fc6

Merge remote-tracking branch 'upstream/main' into input_wise_masks

993f9db

UFO-101 merged commit 3ad51b5 into UFO-101:main Jul 19, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Input wise masks for mask gradients #4

Input wise masks for mask gradients #4

oliveradk commented Jul 12, 2024

UFO-101 left a comment •

edited

Loading

oliveradk commented Jul 15, 2024

UFO-101 left a comment

UFO-101 Jul 16, 2024

oliveradk Jul 18, 2024

UFO-101 Jul 18, 2024

UFO-101 left a comment

UFO-101 commented Jul 18, 2024

UFO-101 commented Jul 19, 2024

oliveradk commented Jul 19, 2024

Input wise masks for mask gradients #4

Input wise masks for mask gradients #4

Conversation

oliveradk commented Jul 12, 2024

UFO-101 left a comment • edited Loading

Choose a reason for hiding this comment

oliveradk commented Jul 15, 2024

UFO-101 left a comment

Choose a reason for hiding this comment

UFO-101 Jul 16, 2024

Choose a reason for hiding this comment

oliveradk Jul 18, 2024

Choose a reason for hiding this comment

UFO-101 Jul 18, 2024

Choose a reason for hiding this comment

UFO-101 left a comment

Choose a reason for hiding this comment

UFO-101 commented Jul 18, 2024

UFO-101 commented Jul 19, 2024

oliveradk commented Jul 19, 2024

UFO-101 left a comment •

edited

Loading