ReLU is used in hidden layers WHY?

Aishwarya_Mundley · May 31, 2023, 2:33pm

If ReLU is used in hidden layers, the output of that layer will always be positive. How is that correct for regression models?
If hidden layers output only 0 and positive numbers then how will final output layer whose input is only positive(or 0) give correct answer?

rmwkwok · May 31, 2023, 2:42pm

Hey @Aishwarya_Mundley,

A quick response is, you can have negative weights in the output layer to make the output negative.

The incoming activation values from the last hidden layer are non-negatives, but the weights in the output layer can be negative.

I have to go now. If you have any follow-up, I am sure other mentors who have time can answer you

Cheers,
Raymond

saifkhanengr · May 31, 2023, 3:03pm

As Raymond explained, the last hidden layers’ output (which is 0 or positive) will feed to the output layer and the output layer then can give out the negative number as well (because of different activation function).

Best,
Saif.

Mujassim_Jamal · May 31, 2023, 4:18pm

@Aishwarya_Mundley, That is the reason why we should generally avoid using ReLU in the output layer in certain cases.

Topic		Replies	Views
Can 'relu' activation be used in the last layer of a neural network? AI Discussions ai-discussions	3	1123	January 20, 2024
Default activation function in hidden layers Advanced Learning Algorithms week-2	3	514	July 27, 2022
Is ReLU not destructive in backpropagation? AI Discussions ai-discussions	6	161	April 13, 2024
Why use ReLU for hidden layers when output layer is linear? Advanced Learning Algorithms week-2	3	323	October 25, 2023
Doubt about Relu activation in hidden layer Advanced Learning Algorithms week-2	3	804	January 29, 2023

ReLU is used in hidden layers WHY?

Related topics