Course 1 : sigmoid vs tanh function

khirman · August 21, 2021, 7:29pm

kindly explain in simple words why tanh is better than sigmoid function?

khirman · August 21, 2021, 8:37pm

Mean of your data closer to 0 rather than 0.5 and make learning easy for next layer. kindly elaborate this point of instructor .

kenb · August 23, 2021, 1:44pm

The sigmoid function is useful as the output layer in classification tasks, in which one tries to ascertain the probability of an object (e.g. an image) belonging to a specific class (e.g. a cat). In terms of probability, it is a valid “cumulative density function.” Specifically, it is monotonically increasing in the range between zero and one. As such, its output can be interpreted as a probability. It is important to note that it evaluates to 0.5 at 0.

The tanh has the same “S-shape” as the sigmoid, but it ranges between -1 and 1 and evaluates to 0 at “Z=0.” This is a useful property for the hidden layers as it is usually the case that the data and the inputs are normalized so that they have mean zero and a unit standard deviation (for a number of reasons which will become clear in the second course).

The S-shape implies that as the linear activation Z, evaluates further from zero, the activation becomes stronger–either negatively or positively–and the gradient becomes smaller.

Topic		Replies	Views
Is Tanh better than sigmoid? Neural Networks and Deep Learning	5	658	May 11, 2023
Why not use tanh-func for output a^L? Neural Networks and Deep Learning	1	511	August 5, 2021
Using tanh vs. sigmoid for output layer Neural Networks and Deep Learning	6	770	October 20, 2022
Better Activation functions: (tanh > sigmoid) MLS Resources	18	941	November 10, 2022
Why is sigmoid activation function better for binary classification than the tanh activation function Improving Deep Neural Networks: Hyperparameter tun	2	679	September 21, 2021

Course 1 : sigmoid vs tanh function

Related topics