Why is sigmoid activation function better for binary classification than the tanh activation function

paulinpaloalto · September 21, 2021, 12:33am

What cost function would you use if tanh is your output activation? The cross entropy log loss function that we use will not handle outputs other than in the range 0 to 1.

If your response is, well we could shift and scale tanh to have the range (0,1), then guess what? It turns out tanh and sigmoid are very closely related mathematically, so you don’t really gain any advantage by that strategy.

Topic		Replies	Views
Is Tanh better than sigmoid? Neural Networks and Deep Learning coursera-platform	5	672	May 11, 2023
Better Activation functions: (tanh > sigmoid) MLS Resources	18	1057	November 10, 2022
Using tanh vs. sigmoid for output layer Neural Networks and Deep Learning coursera-platform	6	776	October 20, 2022
Question about c1w3 quiz Neural Networks and Deep Learning coursera-platform	2	699	October 30, 2021
First binary classification model Neural Networks and Deep Learning coursera-platform	5	563	July 12, 2022

Why is sigmoid activation function better for binary classification than the tanh activation function

Related topics