Maybe a bug about `multi_scale_deform_attn` #2159

pixeli99 · 2022-07-29T17:43:17Z

Hi,
I got an error when executing this line of code.

mmcv/mmcv/ops/multi_scale_deform_attn.py

Line 324 in 4558bfb

sampling_offsets = self.sampling_offsets(query).view(

RuntimeError: Expected all tensors to be on the same device, but found at least two devices, cuda:0 and cpu!

I personally feel that there is a problem with initializing weights. You can see here, the bias of sampling_offsets may be put on cpu?

mmcv/mmcv/ops/multi_scale_deform_attn.py

Line 249 in 4558bfb

self.sampling_offsets.bias.data = grid_init.view(-1)

The device of grid_init is cpu on my machine.

thetas = torch.arange(
            self.num_heads,
            dtype=torch.float32) * (2.0 * math.pi / self.num_heads)
grid_init = torch.stack([thetas.cos(), thetas.sin()], -1)

The text was updated successfully, but these errors were encountered:

zhouzaida · 2022-07-30T08:41:38Z

Calling model.init_weight() after model.to('cuda:0') will cause this error you mentioned above. Are you willing to create a PR to resolve it?

pixeli99 · 2022-07-30T08:55:10Z

OK, but where was the model.to('cuda:0') called?

I'm not sure where to fix it

zhouzaida · 2022-07-30T09:00:58Z

Oh, and there's already a PR (#2158) to fix this bug.

pixeli99 · 2022-07-30T09:03:23Z

OK, thank you for your reply.😊

zhouzaida · 2022-08-03T12:00:07Z

Oh, and there's already a PR (#2158) to fix this bug.

Done

mm-assistant bot assigned ice-tong Jul 29, 2022

zhouzaida linked a pull request Jul 30, 2022 that will close this issue

[Fix] Fix init weights #2158

Merged

7 tasks

zhouzaida closed this as completed Aug 3, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Maybe a bug about `multi_scale_deform_attn` #2159

Maybe a bug about `multi_scale_deform_attn` #2159

pixeli99 commented Jul 29, 2022

zhouzaida commented Jul 30, 2022

pixeli99 commented Jul 30, 2022

zhouzaida commented Jul 30, 2022

pixeli99 commented Jul 30, 2022

zhouzaida commented Aug 3, 2022

Maybe a bug about multi_scale_deform_attn #2159

Maybe a bug about multi_scale_deform_attn #2159

Comments

pixeli99 commented Jul 29, 2022

zhouzaida commented Jul 30, 2022

pixeli99 commented Jul 30, 2022

zhouzaida commented Jul 30, 2022

pixeli99 commented Jul 30, 2022

zhouzaida commented Aug 3, 2022

Maybe a bug about `multi_scale_deform_attn` #2159

Maybe a bug about `multi_scale_deform_attn` #2159