You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Hello, I try to print the value of Resampler, and found that the output values are strange (too large: -12296, 1219).
And I tried to train IP-Adapter-Plus-SDXL in fp16, which was easy to numerical overflow and caused NaN loss, have you adopted some tricks for stability training?
Hello, I try to print the value of Resampler, and found that the output values are strange (too large: -12296, 1219).
And I tried to train IP-Adapter-Plus-SDXL in fp16, which was easy to numerical overflow and caused NaN loss, have you adopted some tricks for stability training?
image (attn): tensor(-55.5000, device='cuda:0', dtype=torch.float16) tensor(41.8750, device='cuda:0', dtype=torch.float16) tensor(-0.0103, device='cuda:0', dtype=torch.float16)
image (attn): tensor(-82.1875, device='cuda:0', dtype=torch.float16) tensor(64.3750, device='cuda:0', dtype=torch.float16) tensor(-0.0302, device='cuda:0', dtype=torch.float16)
image (attn): tensor(-271.7500, device='cuda:0', dtype=torch.float16) tensor(116.8750, device='cuda:0', dtype=torch.float16) tensor(-0.0097, device='cuda:0', dtype=torch.float16)
image (attn): tensor(-252.5000, device='cuda:0', dtype=torch.float16) tensor(184.6250, device='cuda:0', dtype=torch.float16) tensor(0.0262, device='cuda:0', dtype=torch.float16)
image (bfr proj_out): tensor(-252.5000, device='cuda:0', dtype=torch.float16) tensor(184.6250, device='cuda:0', dtype=torch.float16) tensor(0.0262, device='cuda:0', dtype=torch.float16)
image (bfr norm): tensor(-12296., device='cuda:0', dtype=torch.float16) tensor(1219., device='cuda:0', dtype=torch.float16) tensor(0.5435, device='cuda:0', dtype=torch.float16)
The text was updated successfully, but these errors were encountered: