How does regularization work on layer with activation "relu" in neural network?

Tram_Nguyen · September 1, 2022, 12:58pm

From my understanding, “relu” function is same with linear regression except y = 0 with x <= 0. So it will easily lead to high bias instead of variance. Why do we have to set Lambda for it?

gent.spah · September 1, 2022, 1:08pm

These dense layes have neuron units with weights and biases i.e. (wx+b), and then an activation is used. The regularization happens at the weights before the activation is applied, then that value is passed through the ‘relu’.

TMosh · September 1, 2022, 5:55pm

The ReLU units each generate a little line segment. If you have too many ReLU units, you can get overfitting. Regularization is one way to fix that.

Christian_Simonis · January 4, 2023, 5:27pm

Hi @Tram_Nguyen

this thread touches upon ReLU vs. linear regression, too. I believe it could be interesting for you and should clarify potential doubts.

Have a good one!

Best regards
Christian

Topic		Replies	Views
How does regularization work on layer with activation “relu” in neural network? Improving Deep Neural Networks: Hyperparameter tun week-1	2	124	May 15, 2024
Tips on applying regularization AI Discussions	18	107	November 22, 2023
Questions about regularization Improving Deep Neural Networks: Hyperparameter tun week-1	6	36	July 13, 2024
Why using ReLU doesn't result in high bias error? Improving Deep Neural Networks: Hyperparameter tun	1	554	April 25, 2022
Week 1 - why regularization works with ReLu Improving Deep Neural Networks: Hyperparameter tun	5	686	January 14, 2022

How does regularization work on layer with activation "relu" in neural network?

Related topics