Fix device handling in tests #52

dtch1997 · 2023-12-23T23:10:50Z

Modifies ModelPatcher so that target activations are moved onto the appropriate device before performing operator.

closes #51

dtch1997 · 2023-12-23T23:17:48Z

repepo/utils/model_patcher.py

@@ -185,7 +185,8 @@ def _create_additive_hook(

    def hook_fn(_m: Any, _inputs: Any, outputs: Any) -> Any:
        original_tensor = untuple_tensor(outputs)
-        original_tensor[None] = operator(original_tensor, target_activation)
+        act = target_activation.to(original_tensor.device)


Setting the device here allows us to skip setting the activation devices.

E.g.:

activations = { 1: torch.randn(1, 512), # can be on cpu } ... model_patcher.patch_activations( activations, layer_type=layer_type, operator="piecewise_addition" )

dtch1997 · 2023-12-23T23:21:43Z

As of 16426e4, all the tests pass for me locally (with GPU enabled):

$ python -m pytest tests/util/test_model_patcher.py 
...
12 passed, 9 warnings in 15.54s

chanind

makes sense, LGTM!

dtch1997 · 2024-01-10T12:31:25Z

@chanind after merging in your latest changes from main, one of the tests isn't working any more.

Looking at the error logs, it seems to be in this code chunk:

def _create_additive_hook(
    target_activation: torch.Tensor, operator: PatchOperator
) -> Any:
    """Create a hook function that adds the given target_activation to the model output"""

    def hook_fn(_m: Any, _inputs: Any, outputs: Any) -> Any:
        original_tensor = untuple_tensor(outputs)
        act = target_activation.to(original_tensor.device) # This line raises an error
        original_tensor[None] = operator(original_tensor, act)
        return outputs

    return hook_fn

Looking at the CI logs here, this line fails:

act = target_activation.to(original_tensor.device)

with this error:

AttributeError: 'numpy.ndarray' object has no attribute 'to'

I.e. the target_activation seems to be a numpy array. Is this behaviour intended? The type hint indicates that target_activation should be a torch.Tensor

chanind · 2024-01-10T13:35:39Z

Good catch! Fixed in #55

dtch1997 added 2 commits December 23, 2023 23:09

Fix device handling in tests

6855e1c

Fix pyright bugbears

16426e4

dtch1997 commented Dec 23, 2023

View reviewed changes

dtch1997 requested a review from chanind December 23, 2023 23:18

chanind approved these changes Dec 24, 2023

View reviewed changes

Merge branch 'main' into model_patcher_device

09c50ce

dtch1997 added 3 commits January 10, 2024 13:59

Merge branch 'main' into model_patcher_device

4dae86b

Set device automatically for RepE pipeline

578d93b

Specify device in test

9343ed5

dtch1997 merged commit f46940a into main Jan 10, 2024
2 checks passed

dtch1997 deleted the model_patcher_device branch January 31, 2024 11:07

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix device handling in tests #52

Fix device handling in tests #52

dtch1997 commented Dec 23, 2023 •

edited

Loading

dtch1997 Dec 23, 2023 •

edited

Loading

dtch1997 commented Dec 23, 2023

chanind left a comment

dtch1997 commented Jan 10, 2024 •

edited

Loading

chanind commented Jan 10, 2024

Fix device handling in tests #52

Fix device handling in tests #52

Conversation

dtch1997 commented Dec 23, 2023 • edited Loading

dtch1997 Dec 23, 2023 • edited Loading

Choose a reason for hiding this comment

dtch1997 commented Dec 23, 2023

chanind left a comment

Choose a reason for hiding this comment

dtch1997 commented Jan 10, 2024 • edited Loading

chanind commented Jan 10, 2024

dtch1997 commented Dec 23, 2023 •

edited

Loading

dtch1997 Dec 23, 2023 •

edited

Loading

dtch1997 commented Jan 10, 2024 •

edited

Loading