Activation Functions & Vital Parameters

oliviac · July 4, 2023, 4:25am

If you use the same cost function and model and train two different models one using relu and one using tanh for the hidden layers given enough iterations: (assuming final output activation function is the same for both)

Would you still end up with the same parameters for the final model because the cost function is the same or does the choice of hidden layer activations affect the final parameters?

saifkhanengr · July 4, 2023, 5:24am

The activation function for the hidden layers can affect the final parameters of the model. So, you will end up with different values and even different accuracy.

TMosh · July 4, 2023, 6:29am

In particular, relu tends to need more units, because negative values create no gradients. These units become essentially inactive.

Topic		Replies	Views
Using different activation functions in the hidden layer? Advanced Learning Algorithms week-module-2	2	522	July 21, 2022
Using different activation function for hidden layers Neural Networks and Deep Learning coursera-platform	4	1687	February 7, 2022
Activation function clarification Neural Networks and Deep Learning coursera-platform	1	527	August 11, 2021
Week3 - Choice of Activation function Neural Networks and Deep Learning coursera-platform	2	756	February 5, 2022
DL and NN course1 Week#3: Understanding Activation functions Neural Networks and Deep Learning week-module-3 , coursera-platform	2	32	March 4, 2025

Activation Functions & Vital Parameters

Related topics