Leaky RELU Activation Function Clarification

Anbu · May 26, 2021, 4:55am

Hi Sir,

This is the function max(0.01z , z) for leaky RELU. Here 0.01 value is fixed one always or do we need to try again cross validation set ?

Because In Activation Function lecture video Prof Andrew Ng told like And you might say, why is that constant 0.01? Well, you can also make that another parameter of the learning algorithm.

So another parameter of the learning algorithm . what does this statement meaning?

albertovilla · May 26, 2021, 6:21am

Hi @Anbu, I think Andrew Ng meant another hyperparameter of the learning algorithm, meaning that you could try different values for that in order to check which one works best for your particular case during the training phase.

Note that different deep learning frameworks use different default values for LeakyReLU, in the case of Pytorch for example the default value is 0.01 in Tensorflow the value is 0.3. Both frameworks allow you to specify the value manually so you could treat that as a hyperparameter if you want to play with it.

ai_curious · September 30, 2021, 1:52pm

I notice in the TensorFlow documentation that both relu and leakyrelu accept a parameter alpha. Per the documentation, the default for relu is alpha=0.0 while the default for leakyrelu is alpha=0.3

My reading of the parameter explanation suggests that using relu with alpha=0.3 is the same as using leakyrelu with alpha=0.3 (the default). Am I missing something? Why have both if you can just set the parameter in relu to accomplish the same thing?

Here are the relevant passages:
relu: With default values, this returns the standard ReLU activation: max(x, 0) , the element-wise maximum of 0 and the input tensor.

Modifying default parameters allows you to use non-zero thresholds, change the max value of the activation, and to use a non-zero multiple of the input for values below the threshold.

leakyrelu:

 f(x) = alpha * x if x < 0  
 f(x) = x if x >= 0

ai_curious · October 3, 2021, 6:54pm

From a brief hunt through the source code on github, it looks like under the covers leaky_relu and relu end up calling the same lower level code. They are both just wrappers that pass through different parameters/defaults.

Topic		Replies	Views
Leaky RELU Activation Function Neural Networks and Deep Learning	2	557	May 25, 2021
W3_A1_Implementing ReLu in NN Neural Networks and Deep Learning week-3	5	649	January 4, 2024
Can I know in what scenario one would prefer LeakyRelu than Relu activation function? Advanced Learning Algorithms week-1	10	424	September 12, 2023
Week2 ReLU activation optional lab Advanced Learning Algorithms week-2	1	440	May 29, 2023
Default activation function in hidden layers Advanced Learning Algorithms week-2	3	514	July 27, 2022

Leaky RELU Activation Function Clarification

Related topics