Is Tanh better than sigmoid?

paulinpaloalto · May 6, 2023, 3:56pm

In a related question, sometimes people ask if we could even use tanh as the activation at the output layer in a binary classification and then use >= 0 as “Yes” and < 0 as “No”. But the question then is what you would use as a loss function, since cross entropy loss is specifically designed to work with sigmoid. Here’s a thread which discusses that in more detail and also ends up showing that there is a very close relationship mathematically between sigmoid and tanh.

Topic		Replies	Views
Course 1 : sigmoid vs tanh function Neural Networks and Deep Learning coursera-platform	2	674	August 23, 2021
Why is sigmoid activation function better for binary classification than the tanh activation function Improving Deep Neural Networks: Hyperparameter tun coursera-platform	2	703	September 21, 2021
Better Activation functions: (tanh > sigmoid) MLS Resources	18	1451	November 10, 2022
Tanh and sigmoid are closely related Neural Networks and Deep Learning coursera-platform	3	968	March 3, 2022
Why not use tanh-func for output a^L? Neural Networks and Deep Learning coursera-platform	1	518	August 5, 2021

Is Tanh better than sigmoid?

Related topics