Enable special/nonexistent regularisation of the first layer #72

korsbo · 2022-05-18T09:40:01Z

Sometimes, it can be hard to know the input data's scale, so it might be hard to standardise them (like in a UDE). It might then make sense to let the parameters of the first layer be unregularised or weakly regularised such that they can better compensate for differences in scale between the inputs. Something like FrontMiddleLastPenalty, although that's getting a bit verbose.

The text was updated successfully, but these errors were encountered:

chriselrod · 2022-05-18T12:26:26Z

I think I can add a PerLayer penalty that lets you pass a tuple of penalties.
As well as a NonBiasPenalty, that doesn't get applied to bias.

korsbo · 2022-05-18T15:24:24Z

As well as a NonBiasPenalty, that doesn't get applied to bias.

Would it be better to have some penalty wrappers like you have with FrontLastPenalty or would it be more natural to just let the bias regularisation toggling be a type parameter of L1Penalty and L2Penalty?

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Enable special/nonexistent regularisation of the first layer #72

Enable special/nonexistent regularisation of the first layer #72

korsbo commented May 18, 2022

chriselrod commented May 18, 2022

korsbo commented May 18, 2022

Enable special/nonexistent regularisation of the first layer #72

Enable special/nonexistent regularisation of the first layer #72

Comments

korsbo commented May 18, 2022

chriselrod commented May 18, 2022

korsbo commented May 18, 2022