fix models for PyTorch v0.4 (remove .data and add _ for the initializations … #481

moskomule · 2018-04-27T10:08:13Z

Hi, it's about #479 .
Some models warn about initialization because of using nn.init.**(tensor) instead of nn.init.**_(tensor) so I moved them to nn.init.**_(). I also removed Variable.data.

…in nn.init)

torchvision/models/densenet.py

            elif isinstance(m, nn.BatchNorm2d):
-                m.weight.data.fill_(1)
-                m.bias.data.zero_()
+                m.weight.fill_(1)


soumith

the .data is equivalent of: dont record history as I do these operations.
I think it needs to be preserved. We dont want to have an autograd graph defined (or backproping through) the initialization operations

soumith · 2018-04-27T12:32:16Z

the nn.init.**_() are great. please remove .data changes

fmassa · 2018-04-27T12:34:25Z

I think another option (instead of using .data) would be to use with torch.no_grad():. Would that be more in line with best practices for v0.4?

soumith · 2018-04-27T12:38:01Z

no, let's not have to use with torch.no_grad in exchange for verbosity / readability in these simple ccases

fmassa · 2018-04-27T21:59:35Z

I have another proposal: replace the hand-coded initialization with torch.nn.init. They internally use torch.no_grad(), so we can also remove the .data from the code. What do you think?

moskomule · 2018-04-28T04:11:01Z

So m.weight.fill_(1), for example, to nn.init.constant_(m.weight, 1)?

fmassa · 2018-04-28T13:24:16Z

@moskomule yes, precisely

fmassa

Thanks for the modifications! I still have some comments that could simplify some parts of the code.
Also, could you revert the unnecessary line changes / spaces added?

torchvision/models/vgg.py

@@ -48,15 +48,15 @@ def _initialize_weights(self):
        for m in self.modules():
            if isinstance(m, nn.Conv2d):
                n = m.kernel_size[0] * m.kernel_size[1] * m.out_channels
-                m.weight.data.normal_(0, math.sqrt(2. / n))
+                nn.init.normal_(m.weight, 0, math.sqrt(2. / n))


torchvision/models/squeezenet.py

@@ -4,10 +4,8 @@
 import torch.nn.init as init
 import torch.utils.model_zoo as model_zoo

-


torchvision/models/resnet.py

@@ -113,18 +111,18 @@ def __init__(self, block, layers, num_classes=1000):
        for m in self.modules():
            if isinstance(m, nn.Conv2d):
                n = m.kernel_size[0] * m.kernel_size[1] * m.out_channels
-                m.weight.data.normal_(0, math.sqrt(2. / n))
+                nn.init.normal_(m.weight, 0, math.sqrt(2. / n))


torchvision/models/resnet.py

-                nn.Conv2d(self.inplanes, planes * block.expansion,
-                          kernel_size=1, stride=stride, bias=False),
-                nn.BatchNorm2d(planes * block.expansion),
+                    nn.Conv2d(self.inplanes, planes * block.expansion,


torchvision/models/inception.py

-                m.weight.data.copy_(values)
+                values = torch.Tensor(X.rvs(m.weight.numel()))
+                values = values.view(m.weight.size())
+                m.weight.copy_(values)


moskomule · 2018-04-29T15:12:05Z

Thanks for reviewing. For the styles, I've fixed. About the simplicity, can you check the comments?

moskomule · 2018-04-29T15:35:38Z

For inception.py, if we have truncated_normal, then it will be simpler.

fmassa

Almost good. A few more comments

torchvision/models/resnet.py

@@ -112,11 +112,10 @@ def __init__(self, block, layers, num_classes=1000):

        for m in self.modules():
            if isinstance(m, nn.Conv2d):
-                n = m.kernel_size[0] * m.kernel_size[1] * m.out_channels
-                m.weight.data.normal_(0, math.sqrt(2. / n))
+                nn.init.kaiming_normal_(m.weight, mode="fan_out")


torchvision/models/densenet.py

@@ -130,11 +130,11 @@ def __init__(self, num_input_features, growth_rate, bn_size, drop_rate):
        self.add_module('norm1', nn.BatchNorm2d(num_input_features)),
        self.add_module('relu1', nn.ReLU(inplace=True)),
        self.add_module('conv1', nn.Conv2d(num_input_features, bn_size *
-                        growth_rate, kernel_size=1, stride=1, bias=False)),
+                                           growth_rate, kernel_size=1, stride=1, bias=False)),


torchvision/models/vgg.py

@@ -47,16 +47,15 @@ def forward(self, x):
    def _initialize_weights(self):
        for m in self.modules():
            if isinstance(m, nn.Conv2d):
-                n = m.kernel_size[0] * m.kernel_size[1] * m.out_channels
-                m.weight.data.normal_(0, math.sqrt(2. / n))
+                nn.init.kaiming_normal_(m.weight, mode="fan_out")


fmassa · 2018-04-30T13:50:21Z

Looks great, thanks @moskomule ! Build failures are unrelated

…ations … (pytorch#481) * fix for PyTorch v0.4 (remove .data and add _ for the initializations in nn.init) * fix m.**.**() style to nn.init.**(**) style * remove .idea * fix lines and indents * fix lines and indents * change to use `kaming_normal_` * add `.data` for safety * add nonlinearity='relu' for sure * fix indents

fix for PyTorch v0.4 (remove .data and add _ for the initializations …

62dfcf2

…in nn.init)

soumith reviewed Apr 27, 2018

View reviewed changes

torchvision/models/densenet.py Outdated

elif isinstance(m, nn.BatchNorm2d):

m.weight.data.fill_(1)

m.bias.data.zero_()

m.weight.fill_(1)

This comment was marked as off-topic.

Sign in to view

soumith requested changes Apr 27, 2018

View reviewed changes

moskomule added 2 commits April 29, 2018 11:09

fix m.**.**() style to nn.init.**(**) style

89c8e19

remove .idea

4bb1946

fmassa requested changes Apr 29, 2018

View reviewed changes

fmassa mentioned this pull request Apr 29, 2018

Update densenet model #482

Closed

moskomule added 2 commits April 30, 2018 00:09

fix lines and indents

c54e64c

fix lines and indents

560244b

moskomule added 2 commits April 30, 2018 00:29

change to use kaming_normal_

6a273bd

add .data for safety

043e058

fmassa requested changes Apr 29, 2018

View reviewed changes

moskomule added 2 commits April 30, 2018 09:16

add nonlinearity='relu' for sure

d8bcd1f

fix indents

9469d88

fmassa merged commit f87a896 into pytorch:master Apr 30, 2018

fmassa mentioned this pull request Aug 3, 2018

Some models warn about nn.init.kaiming_normal #479

Closed

		@@ -4,10 +4,8 @@
		import torch.nn.init as init
		import torch.utils.model_zoo as model_zoo

fix models for PyTorch v0.4 (remove .data and add _ for the initializations … #481

fix models for PyTorch v0.4 (remove .data and add _ for the initializations … #481

Uh oh!

Conversation

moskomule commented Apr 27, 2018

Uh oh!

This comment was marked as off-topic.

Uh oh!

soumith left a comment

Choose a reason for hiding this comment

Uh oh!

soumith commented Apr 27, 2018

Uh oh!

fmassa commented Apr 27, 2018

Uh oh!

soumith commented Apr 27, 2018

Uh oh!

fmassa commented Apr 27, 2018

Uh oh!

moskomule commented Apr 28, 2018

Uh oh!

fmassa commented Apr 28, 2018

Uh oh!

fmassa left a comment

Choose a reason for hiding this comment

Uh oh!

This comment was marked as off-topic.

Uh oh!

This comment was marked as off-topic.

Uh oh!

This comment was marked as off-topic.

Uh oh!

This comment was marked as off-topic.

Uh oh!

This comment was marked as off-topic.

Uh oh!

This comment was marked as off-topic.

Uh oh!

This comment was marked as off-topic.

Uh oh!

This comment was marked as off-topic.

Uh oh!

This comment was marked as off-topic.

Uh oh!

This comment was marked as off-topic.

Uh oh!

This comment was marked as off-topic.

Uh oh!

This comment was marked as off-topic.

Uh oh!

moskomule commented Apr 29, 2018

Uh oh!

moskomule commented Apr 29, 2018

Uh oh!

fmassa left a comment

Choose a reason for hiding this comment

Uh oh!

This comment was marked as off-topic.

Uh oh!

This comment was marked as off-topic.

Uh oh!

This comment was marked as off-topic.

Uh oh!

This comment was marked as off-topic.

Uh oh!

fmassa commented Apr 30, 2018

Uh oh!

Uh oh!