Local Response Normalization[W.I.P] #312

ayush1999 · 2018-07-09T11:50:30Z

Added a new LRNorm struct, which takes four hyper parameters and returns the normalized output.
Implemented by referring to the ImageNet paper which can be found here.

MikeInnes · 2018-07-17T12:49:17Z

src/layers/normalise.jl

+  return temp
+end 
+
+children(LRN::LRNorm) =


Any reason not to use treelike here?

MikeInnes · 2018-07-17T12:50:33Z

Looks generally good, but is it possible to have a vectorised implementation so it's more GPU-friendly?

MikeInnes · 2018-07-17T12:51:04Z

src/layers/normalise.jl

+function (LRN::LRNorm)(x)
+  w,h,C,N = size(x)
+  temp = zeros(size(x))
+  for z_=1:N


Slightly cleaner to do for z_=1:N, x_=1:w, ...; end here.

ayush1999 · 2018-07-18T06:39:40Z

@MikeInnes I've made the required changes. I'm not sure if we could vectorise it. It'd be better to get this merged for now, and make changes if a more optimal way is found in the future. (As of now, we'd have to use these 5 loops, since we're visiting every element of the 4-D input and traveling adjacent cells for each of them )

MikeInnes · 2018-07-23T10:55:49Z

Ok. Some tests will be essential, but we also need an implementation of the backward pass – at least on CPU – so that it can be used for training. Ideally we'd also have a GPU implementation, but I suppose we can punt on that for now.

ayush1999 · 2018-07-25T05:58:49Z

@MikeInnes sure, I'll add a few tests. Also, since the LRN implementation uses base operators, AD should be able to handle the backprop itself, right? So there's no need to explicitly write the gradients for them

MikeInnes · 2018-07-25T07:34:21Z

No, that's what I meant above about vectorising it. If you want gradients and GPU support to work automatically you need to use whole-array operations like matmul, broadcasting and reduction. They can't support general loops and scalar indexing, so if you write it like that you also need a gradient implementation (as well as GPU versions).

jekbradbury · 2018-07-26T00:23:11Z

Looks like LRN is a fairly involved operation to write with purely vectorized ops (although it would be straightforward with something like Tensor Comprehensions, and there might be a way to leverage a convolution operation to perform the local summation).

jekbradbury · 2018-07-26T00:34:57Z

Here's the best reference I can find for the manually-written gradient of LRN:
https://github.com/tensorflow/tensorflow/blob/master/tensorflow/compiler/tf2xla/kernels/lrn_ops.cc#L120-L137

staticfloat · 2019-02-27T10:53:00Z

I think we can rewrite this in terms of convolution. We are basically doing a 1x1 convolution where LRN.n of the kernels are summed together after squaring. We should be able to do something like:

C_in = size(x, 3)

# Build 1x1 convolutional kernel that sums together the `LRN.n` channels centered around each output channel.
# We don't need to build this every time, it's constant, so we can do this at layer initialization time.
w = zeros(eltype(x), 1, 1, C_in, C_in)
for idx in 1:C_in
    w[1, 1, clamp(idx - div(LRN.n,2), 1, C_in):clamp(idx + div(LRN.n,2), 1, C_in), idx] .= 1.0
end

# Perform this convolution on the squared inputs
a = NNlib.conv(x.^2, w)
return x ./ (LRN.k .+ LRN.alpha .* a)

This should make the backward pass much easier for the compiler to figure out, and is also probably more efficient than what you have hand-written.

CarloLucibello · 2020-12-27T19:54:12Z

closing as very old for housecleaning

Initial LRNorm commit

053992f

MikeInnes reviewed Jul 17, 2018

View reviewed changes

src/layers/normalise.jl Outdated

return temp

end

children(LRN::LRNorm) =

Copy link

Member

MikeInnes Jul 17, 2018

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Any reason not to use treelike here?

MikeInnes reviewed Jul 17, 2018

View reviewed changes

few more changes

91e9e57

changes -2

9b56439

MikeInnes requested a review from staticfloat February 27, 2019 10:20

CarloLucibello mentioned this pull request Dec 27, 2020

PyTorch feature parity #1431

Open

92 tasks

CarloLucibello closed this Dec 27, 2020

ToucheSir mentioned this pull request Feb 12, 2021

Added Local Contrast Normalization. #210

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Local Response Normalization[W.I.P] #312

Local Response Normalization[W.I.P] #312

ayush1999 commented Jul 9, 2018

MikeInnes Jul 17, 2018

MikeInnes commented Jul 17, 2018

MikeInnes Jul 17, 2018

ayush1999 commented Jul 18, 2018

MikeInnes commented Jul 23, 2018

ayush1999 commented Jul 25, 2018

MikeInnes commented Jul 25, 2018

jekbradbury commented Jul 26, 2018

jekbradbury commented Jul 26, 2018

staticfloat commented Feb 27, 2019 •

edited

Loading

CarloLucibello commented Dec 27, 2020

Local Response Normalization[W.I.P] #312

Local Response Normalization[W.I.P] #312

Conversation

ayush1999 commented Jul 9, 2018

MikeInnes Jul 17, 2018

Choose a reason for hiding this comment

MikeInnes commented Jul 17, 2018

MikeInnes Jul 17, 2018

Choose a reason for hiding this comment

ayush1999 commented Jul 18, 2018

MikeInnes commented Jul 23, 2018

ayush1999 commented Jul 25, 2018

MikeInnes commented Jul 25, 2018

jekbradbury commented Jul 26, 2018

jekbradbury commented Jul 26, 2018

staticfloat commented Feb 27, 2019 • edited Loading

CarloLucibello commented Dec 27, 2020

staticfloat commented Feb 27, 2019 •

edited

Loading