Using tanh vs. sigmoid for output layer

paulinpaloalto · October 19, 2022, 3:28pm

Yes, I guess you could think about doing it that way, but that’s not the only thing you have to deal with, right? How do you define your loss function in that case? With the sigmoid outputs looking like probabilities, that gives “cross entropy” as the natural loss function.

But other way to ask the question is why do you think your method would be better? Also note that it turns out that tanh and sigmoid are very closely related.

Topic		Replies	Views
Is Tanh better than sigmoid? Neural Networks and Deep Learning	5	658	May 11, 2023
Why not use tanh-func for output a^L? Neural Networks and Deep Learning	1	511	August 5, 2021
Why is sigmoid activation function better for binary classification than the tanh activation function Improving Deep Neural Networks: Hyperparameter tun	2	679	September 21, 2021
Question about c1w3 quiz Neural Networks and Deep Learning	2	693	October 30, 2021
Tanh and sigmoid are closely related Neural Networks and Deep Learning	3	850	March 3, 2022

Using tanh vs. sigmoid for output layer

Related topics