[compute_numeric_gradient] Use torch.flatten instead of view(-1). #5

mhaz · 2022-01-21T10:17:22Z

I ran into technical issues when doing the convolutional network exercise from assignment 3 for the 2019 iteration of the class.
The issue was the same as #4 and stemmed from using .view(-1) to flatten the f(x+h) and f(x-h) tensors in the numerical gradient evaluation. Using flatten instead seems to fix the problem.

Unfortunately, I ran into another runtime error the "Batchnorm for deep convolutional networks" subsection. It seems to stem from the implementation of FastConv, but this might be more than I can chew right now.

[compute_numeric_gradient] Use torch.flatten instead of view(-1).

c21e03a

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[compute_numeric_gradient] Use torch.flatten instead of view(-1). #5

[compute_numeric_gradient] Use torch.flatten instead of view(-1). #5

mhaz commented Jan 21, 2022

[compute_numeric_gradient] Use torch.flatten instead of view(-1). #5

Are you sure you want to change the base?

[compute_numeric_gradient] Use torch.flatten instead of view(-1). #5

Conversation

mhaz commented Jan 21, 2022