Add support for lecun normal weight initialization #2290

p-w-rs · 2023-07-15T12:47:54Z

Motivation and description

Lecun normal initialization is needed (as far as I understand) to properly make self normalized neural networks.

Since Flux provides the selu activation function and alpha dropout, it would be nice to have lecun normal built in as well.

Possible Implementation

Draws samples from a truncated normal distribution centered on 0 with stddev = sqrt(1 / fan_in) where fan_in is the number of input units in the weight tensor. (That is from tensorflow website)

darsnack · 2023-07-15T13:20:22Z

Could probably use the existing truncated_normal with the standard deviation set according to the fan in.

vortex73 · 2023-07-26T03:59:45Z

would be interested in solving this if its still open. Please elaborate if it is.

darsnack · 2023-07-26T21:56:19Z

In src/utils.jl we already have a Flux.truncated_normal initializer function that accepts a custom standard deviation. A PR would add a new intializer, Flux.lecun_normal, that just calls the existing truncated_normal with a standard deviation calculated as mentioned above using Flux.nfan in the same src/utils.jl file.

chiral-carbon · 2023-08-08T05:53:14Z

Hi @vortex73, would you be working on this?

darsnack · 2023-08-09T23:04:33Z

@chiral-carbon Please go ahead and open a PR if you are willing to tackle this.

chiral-carbon · 2023-08-10T14:17:43Z

@darsnack thanks, will open a PR soon

chiral-carbon · 2023-08-14T00:14:29Z

@RohitRathore1 were you working on this? I had a PR in the works but will stop @darsnack

Bhavay-2001 · 2023-12-10T05:55:55Z

Hi @chiral-carbon, @darsnack. Is this issue empty? Can I start?

chiral-carbon · 2023-12-10T18:20:10Z

@Bhavay-2001 i had claimed this but then a PR was opened soon after by someone else, so I’m not sure about the status. If this issue opens up again for a new PR I would like to work on it

RohitRathore1 · 2024-01-21T10:43:06Z

Hi @chiral-carbon, I don't know. I have opened a PR but I have not got any comment on it and now I am checking the logs of one GitHub actions then logs are not available. I will have to review it again.

ToucheSir · 2024-01-22T03:49:31Z

I don't recall why that PR didn't get comments. Maybe someone was waiting for tests to be in place? Anyhow, that would be my feedback now. We can continue on the PR thread :)

darsnack added enhancement good first issue labels Jul 15, 2023

RohitRathore1 mentioned this issue Aug 11, 2023

Support for lecun normal weight initialization #2311

Open

3 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add support for lecun normal weight initialization #2290

Add support for lecun normal weight initialization #2290

p-w-rs commented Jul 15, 2023

darsnack commented Jul 15, 2023

vortex73 commented Jul 26, 2023

darsnack commented Jul 26, 2023

chiral-carbon commented Aug 8, 2023

darsnack commented Aug 9, 2023

chiral-carbon commented Aug 10, 2023

chiral-carbon commented Aug 14, 2023 •

edited

Loading

Bhavay-2001 commented Dec 10, 2023

chiral-carbon commented Dec 10, 2023

RohitRathore1 commented Jan 21, 2024

ToucheSir commented Jan 22, 2024

Add support for lecun normal weight initialization #2290

Add support for lecun normal weight initialization #2290

Comments

p-w-rs commented Jul 15, 2023

Motivation and description

Possible Implementation

darsnack commented Jul 15, 2023

vortex73 commented Jul 26, 2023

darsnack commented Jul 26, 2023

chiral-carbon commented Aug 8, 2023

darsnack commented Aug 9, 2023

chiral-carbon commented Aug 10, 2023

chiral-carbon commented Aug 14, 2023 • edited Loading

Bhavay-2001 commented Dec 10, 2023

chiral-carbon commented Dec 10, 2023

RohitRathore1 commented Jan 21, 2024

ToucheSir commented Jan 22, 2024

chiral-carbon commented Aug 14, 2023 •

edited

Loading