Regularization of output layer in neural network

bhavanamalla · July 26, 2023, 11:19am

Hi,

In the week 3 assignment of C3, while using the complex neural network model with regularization, the kernel regularizer is only applied to the first 2 layers but not the output layer. But the output layer also contains w and b parameters and the activation is linear. Is there any reason behind this? Do we need to skip the regularization for the output layer?

tf.random.set_seed(1234)
model_r = Sequential(
[
Dense(120, activation = ‘relu’, kernel_regularizer=tf.keras.regularizers.l2(0.1), name=“L1”),
Dense(40, activation = ‘relu’, kernel_regularizer=tf.keras.regularizers.l2(0.1), name=“L2”),
Dense(classes, activation = ‘linear’, name=“L3”)
], name=“ComplexRegularized”
)

pastorsoto · July 26, 2023, 11:45am

Hi @bhavanamalla great question!

Regularization is typically perform to the training layers, and usually we skip for the output layer, however as machine learning is an iterative process you could experiment adding regularization to the output layer and see if that work better for your dataset.

I hope this helps!

bhavanamalla · July 26, 2023, 2:57pm

Hi @pastorsoto ,

Thanks for your reply.

When adding regularization to the output layer for this particular assignment(complex model with regularization), the performance is highly degrading compared to the one without the regularization of the last layer. I am wondering why is that the case.

pastorsoto · July 26, 2023, 3:03pm

Regularization is a way to penalize the model, but if you penalize the model too much it will underfit your model creating a worse performance since it’s not able to learn the data properly.

I hope this helps

Topic		Replies	Views
C2_W3_Assignment, 6 - Regularization, Exercise 5 Advanced Learning Algorithms week-module-3	7	46	June 19, 2025
NN Regularization for all W parameters Advanced Learning Algorithms week-module-3	3	528	July 15, 2022
Neural network regularization: What does it mean to regularize a hidden layer? Advanced Learning Algorithms week-module-3	4	553	April 3, 2023
Why not specify a regularizer for the output layer Advanced Learning Algorithms week-module-3	2	519	August 14, 2022
Questions about regularization Improving Deep Neural Networks: Hyperparameter tun week-module-1 , coursera-platform	6	37	July 13, 2024

Regularization of output layer in neural network

Related topics