W4_Forward and Backward Propagation - 2 ReLU's in a row?

Apoorva_Dixit1 · December 25, 2022, 2:18am

I’m going through the 6th video, and in an example deep neural network, Dr. Ng draws this:

I know that this is just an off-handed example that was probably created on the fly, but I want to confirm my understanding that using 2 RelU activation functions in a row is equivalent to using just one? So in a real deep neural network, we would most likely not use this.

rmwkwok · December 25, 2022, 3:17am

Hello @Apoorva_Dixit1, welcome to our community!

Yes! ReLU(ReLU(x)) is just ReLU(x).

However, this is NOT the message behind the slide. Each box in the slide represents a Dense layer, and writing “ReLU” in a box means that the Dense layer uses a ReLU activation. Therefore, the slide is talking about ReLU( W^{[2]} \times ReLU( W^{[1]} x+ b^{[1]}) + b^{[2]}).

Cheers,
Raymond

Apoorva_Dixit1 · December 25, 2022, 7:08am

But W^{[1]}x + b^{[1]} is also a linear transformation on X, right? So it’s basically 4 nested linear transformations. Which is equivalent to using RelU in one node?

rmwkwok · December 25, 2022, 8:13am

We can look at this:

ReLU( W^{[2]} \times ReLU( W^{[1]} x+ b^{[1]}) + b^{[2]})

if the inner W^{[1]} x+ b^{[1]} is -1, and W^{[2]} is -1, and b^{[2]} is 0, then the whole expression is computed to ReLU(0) which is 0.

If we take away the inner ReLU, so it becomes ReLU( W^{[2]} \times ( W^{[1]} x+ b^{[1]}) + b^{[2]}), then the answer is changed to ReLU( -1 \times -1+ 0) which is 1.

So, if we take away the inner ReLU, we will be computing a different thing, so in this case, it is not equvialent.

Cheers,
Raymond

Topic		Replies	Views
Course 4, week 2 : why resnets work (relu activation function and output) Convolutional Neural Networks coursera-platform	1	512	February 24, 2022
C2_W2_Relu - How the ReLu works? Advanced Learning Algorithms week-module-2	3	717	July 7, 2022
Unable to get intution on how this works Convolutional Neural Networks coursera-platform	2	520	April 29, 2022
Why is ReLU any better than Linear Neural Networks and Deep Learning coursera-platform	7	882	May 2, 2021
ReLU function in CNN Convolutional Neural Networks coursera-platform	1	547	May 13, 2022

W4_Forward and Backward Propagation - 2 ReLU's in a row?

Related topics