ReLu activation function Vs sigmoid function

VISHNU_K_MENON · June 15, 2022, 5:44am

Is it possible to use ReLu action function for the output layer rather than sigmoid function

gustavyeung · June 15, 2022, 6:56am

Sigmoid function still returns a non-zero number for some negative input. If you don’t expect the output of the layer before activation to be negative, ReLu should be preferred.

anon57530071 · June 15, 2022, 6:59am

Since both have some different characteristics, we usually select the better one to fit to the objectives.
As you know, output from Sigmoid curve is between 0-1. It is good to convert boarder range of data into this “easy-to-understand” range. But, of course, there are some demerits. If an input value is quite large or small, then, the gradient, which is one of important aspect in DL, disappears. The max value of the 1st order derivative (gradient) is small. So, it may be slow in convergence.
ReLU has, as you know, quite unique characteristics. It cuts off negative values and makes it to zero. If you think negative values are quite important (like temperature), it may not work. On the other hand, if input value is positive, it works well especially for hidden layers, since the gradient is constant which helps to reduce the amount of computational efforts. (You will see lot’s of derivatives, partial derivatives, … for back-propagation.) But, of course, this is also a demerit, since the gradient is always 0 for the negative values.
So, we need to choose the right one. Not a simple replacement.

Topic		Replies	Views
ReLU vs Sigmoid function Neural Networks and Deep Learning week-module-1 , coursera-platform	2	49	December 24, 2024
Why is relu better than sigmoid activation function AI Discussions	2	90	June 8, 2023
ReLU function as activation function Advanced Learning Algorithms week-module-2	3	422	July 11, 2023
Activation function in NN NLP with Classification and Vector Spaces week-module-3	3	348	March 30, 2022
How can we use ReLU to approximate sigmoid? Neural Networks and Deep Learning coursera-platform	3	677	January 16, 2022

ReLu activation function Vs sigmoid function

Related topics