About activation functions

Pierre_BEJIAN · August 9, 2022, 3:46pm

Hi,
I have a (not so) basic question: why do we need activation functions?
And in a multilayer NN, how to choose the “good” activate function for each layer?
In a video Andrew said:

sigmoid only for the output of a 2-classification
tanh is superior if the data are “normalized” (don’t remember of the right word)
ReLU is the must used
Leaky ReLU sometimes

Thanks
Regards
Pierre (from France)

Rashmi · August 9, 2022, 4:43pm

Hi, Pierre.

There’s a brilliant thread related to your query where the same topic has been discussed.

Besides, you can check this as well.

If you still face any doubt, you are welcome!

paulinpaloalto · August 9, 2022, 5:15pm

Rashmi has covered your question, but it might also be worth having a look at this thread which talks a bit more about how you approach choosing the activation functions for the hidden layers.

Topic		Replies	Views
Week3 - Choice of Activation function Neural Networks and Deep Learning coursera-platform	2	757	February 5, 2022
Activation functions as hyperparameters Improving Deep Neural Networks: Hyperparameter tun coursera-platform	1	563	September 14, 2021
Better Activation functions: (tanh > sigmoid) MLS Resources	18	1093	November 10, 2022
Using different activation function for hidden layers Neural Networks and Deep Learning coursera-platform	4	1690	February 7, 2022
Neural Network functions Advanced Learning Algorithms week-module-2	3	476	April 22, 2023

About activation functions

Related topics