Leaky RELU Activation Function

Anbu · May 18, 2021, 11:13am

Hi Mentor,

Is this topic leaky Relu will be covered detailed in upcoming course?
Around lecture video from 8:18, the below statement does it mean like we need to choose paramter for ReLu function based on which paramter provides good accuracy , choose it. Does the below statement meaning it ?

And you might say, why is that constant 0.01? Well, you can also make that another parameter of the learning algorithm.

Anbu · May 25, 2021, 10:59am

Can someone please make to understand this doubt ? Is it always fixed 0.01 value for ReLU Activation function or should we use cross validation to find best value ?

max(0.01 * z, z)

Jaskeerat · May 25, 2021, 11:13am

Hi, 0.01 is not a hyperparameter/something we change a lot because it is not really that important. Leaky-ReLU is not seen very frequently in practice either. The 0.01 is just so that we don’t completely remove the negative value if that is something that would be important to the scenario one is working on. But for most cases, regular ReLU works wonderfully well, and even in the leaky relu variation, changing the constant does not make a big difference to performance, hence it is not considered that important. There is no need to make it a parameter of the learning algorithm.

Topic		Replies	Views
Leaky RELU Activation Function Clarification Neural Networks and Deep Learning coursera-platform	3	718	October 3, 2021
Why more people used ReLU even though Leaky RelU is better? Neural Networks and Deep Learning coursera-platform	1	522	January 12, 2022
Activation functions as hyperparameters Improving Deep Neural Networks: Hyperparameter tun coursera-platform	1	567	September 14, 2021
Why Gen uses ReLU while Disc uses LeakyReLU? Build Basic Generative Adversarial Networks week-module-1	2	650	April 4, 2022
Can I know in what scenario one would prefer LeakyRelu than Relu activation function? Advanced Learning Algorithms week-module-1	10	432	September 12, 2023

Leaky RELU Activation Function

Related topics