Why more people used ReLU even though Leaky RelU is better?

Kong_Xiangwei · January 11, 2022, 11:07pm

Professor said that Leaky ReLU is slightly better than ReLU, but why people most frequently used ReLU? For ReLU, if the input is negative, isn’t the parameters end up not updating because the derivative of ReLU is 0?

Tooba_Jalalidil · January 12, 2022, 12:30pm

I had the same question and I found this which would be interesting to you as well. There’s a paper which studied different types of leaky ReLU here. According to the paper, three types of studied leaky ReLU were consistently outperformed the original ReLU. However, reasons of their superior performances still lack rigorous justification from theoretic aspect. Also, how the activations perform on large scale data is still need to be investigated and maybe these are the reasons for using ReLU more frequently. It is worth mentioning that the paper was published in 2015 and I’m not sure if these problems have been solved yet or not.

Topic		Replies	Views
Leaky RELU Activation Function Neural Networks and Deep Learning coursera-platform	2	560	May 25, 2021
ReLU activation function Neural Networks and Deep Learning coursera-platform	8	868	May 2, 2021
Course1 - Week3 Assignment - ReLU gave worse performance than tanh Neural Networks and Deep Learning coursera-platform	3	550	September 9, 2021
Leaky RELU Activation Function Clarification Neural Networks and Deep Learning coursera-platform	3	718	October 3, 2021
Why Gen uses ReLU while Disc uses LeakyReLU? Build Basic Generative Adversarial Networks week-module-1	2	650	April 4, 2022

Why more people used ReLU even though Leaky RelU is better?

Related topics