Issue with changing activation functions #31

rnagurla · 2019-05-09T18:06:17Z

I was wondering how to change the default sigmoid activation function to something else. I've tried changing it to tanh and it's not working. I've also tried using the linear activation function on the examples given and it's failing that as well

codeplea · 2019-05-09T18:34:35Z

You can set activation_hidden and activation_output. However, a linear activation function is not able to solve a non-linear problem, such as xor used in the examples.

rnagurla · 2019-05-09T18:58:43Z

Tanh and ReLU are non linear activation functions right? So I should be able to use those two functions for the examples. However, when I run using those functions it doesn't pass some of the examples. I'm assuming its because the ranges are different than sigmoid. Would I have to change anything in the source code to allow it use tanh or relu?

msrdinesh · 2019-12-09T02:52:40Z

Actually, in the code, the backpropagation algorithm is written only for the sigmoid activation function. We have to change the code for any generic activation function. If no one is working, I can work on this.
similar discussion check here

codeplea · 2019-12-09T04:07:14Z

Yes, back-propagation is only implemented for sigmoid. Other training methods can still work with other activation functions. If back-prop is needed, it'll need to be implemented.

msrdinesh · 2019-12-10T06:41:05Z

Hey @codeplea can I work on this issue? I would like to add back prop for tanh and relu activation functions. If no one else is working on this, pls assign me this issue.

codeplea · 2019-12-10T15:18:23Z

@msrdinesh Sure. Give it a go. Just please keep it short and simple. I think you can mirror the way that output and hidden activation functions are used.

msrdinesh · 2019-12-10T16:53:20Z

Ok, I will do it. Thanks.

mu578 · 2020-09-28T16:15:00Z

@msrdinesh, @codeplea hello any follow up on that matter? Have a good day.

ScratchyCode · 2021-02-23T20:03:26Z

I'm waiting too for update about changing the activation function :)

lucasart · 2021-04-11T11:09:40Z

@moe123 @ScratchyCode It's trivial to adapt backprop to any function you want. Read this, preferably with a pen and paper, redoing the calculation on your own until it becomes crystal clear.

mu578 · 2021-04-11T13:32:23Z

@lucasart computing the derivative is not the problem, the problem is to have a redesign of the code that reflects the current activation function, so something needs to be known and pass along: a state. We can all patch dirty; we already all do; however, we would prefer a clean redesigned approach to support this option + would let the opportunity to run several instances set up differently without tweaking and stirring the code. When you start maintaining third-party forks and patches, it's already too much. I think we all have a float-single version running on an approx of the exp function somewhere.

lucasart · 2021-04-12T02:37:29Z

I wrote my own nn library library, if anyone's interested.

Same functionality as genann. Also uses a flat memory layout for weights+neurons+delta (great for cache efficiency and use with more advanced gradient optimisations methods, so user code can directly adress the weights vector).

But also better, because:

more flexible: hidden layers can each have different number of neurons. error function can be absolute or quadratic (absolute makes more sense than quadratic in a lot of real applications).
cleaner code base: reduces indexing hell by using a layer structure (which points to the right location in the flat array).
trivial to add your own activation functions, without having to touch the backprop code.

mu578 · 2021-04-16T00:51:21Z

@lucasart ; the implementation is interesting; meanwhile, I would go deeper, adding a layer of indirection on any internal arithmetic operations then moving nn_float_t to nn_numeric_t or so ; thus, you'd give the choice to interface with a half-float extension or fixed point representation to the end-user. To note, most people will not be so confortable with your licensing choice even academics.

pjvm742 mentioned this issue Sep 14, 2023

backpropagation for (some) user-defined activation functions #58

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Issue with changing activation functions #31

Issue with changing activation functions #31

rnagurla commented May 9, 2019 •

edited

Loading

codeplea commented May 9, 2019

rnagurla commented May 9, 2019

msrdinesh commented Dec 9, 2019 •

edited

Loading

codeplea commented Dec 9, 2019

msrdinesh commented Dec 10, 2019

codeplea commented Dec 10, 2019

msrdinesh commented Dec 10, 2019 •

edited

Loading

mu578 commented Sep 28, 2020

ScratchyCode commented Feb 23, 2021

lucasart commented Apr 11, 2021

mu578 commented Apr 11, 2021 •

edited

Loading

lucasart commented Apr 12, 2021 •

edited

Loading

mu578 commented Apr 16, 2021

Issue with changing activation functions #31

Issue with changing activation functions #31

Comments

rnagurla commented May 9, 2019 • edited Loading

codeplea commented May 9, 2019

rnagurla commented May 9, 2019

msrdinesh commented Dec 9, 2019 • edited Loading

codeplea commented Dec 9, 2019

msrdinesh commented Dec 10, 2019

codeplea commented Dec 10, 2019

msrdinesh commented Dec 10, 2019 • edited Loading

mu578 commented Sep 28, 2020

ScratchyCode commented Feb 23, 2021

lucasart commented Apr 11, 2021

mu578 commented Apr 11, 2021 • edited Loading

lucasart commented Apr 12, 2021 • edited Loading

mu578 commented Apr 16, 2021

rnagurla commented May 9, 2019 •

edited

Loading

msrdinesh commented Dec 9, 2019 •

edited

Loading

msrdinesh commented Dec 10, 2019 •

edited

Loading

mu578 commented Apr 11, 2021 •

edited

Loading

lucasart commented Apr 12, 2021 •

edited

Loading