Dropout regularization exercise

sima_ranjbari · June 11, 2022, 2:02am

Hi. I have a problem with W1 home work3 . As mentioned in the homework D1 and D2 dimension should be the same as A1 and A2 respectively. I don’t know what are the sizes for them and I wrote D1 = np.random.rand(A1.shape) but I get an error. @paulinpaloalto can you please help me with this part?
Also how we can see the correct answers for the home works?

Thank you so much.
Sima

anon57530071 · June 11, 2022, 2:27am

For this portion,

D1 = np.random.rand(A1.shape) but I get an error.

Different from other random number generators, np.random.rand and np.random.randn are ported from Matlab (another famous programming environment) for the convieenience, and do not accept a tuple like (a,b).

A1.shape returns a tuple like (2,5). To pass this shape to np.random.rand, there are two ways.

break down the tuple into the first and second parameter like A1.shape[0] and A1.shape[1].
use * in front of a tuple like *A1.shape

Other random number generators like np.random.random\_sample accepts a tuple.
So, it should be OK with np.random.random\_sample(A1.shape)

sima_ranjbari · June 11, 2022, 2:39am

It was a great explanation and solved the issue. I used *A1.shape. What happens by using * here? I know that np.random.randn(()) gets tuple as the input but np.random.rand() gets inputs as individulas. Using * gets rid of tuple?

anon57530071 · June 11, 2022, 2:51am

An asterisk is to “unpack” list, tuple, and so on. But, we need to be very careful to use this, since there are some restrictions.
An example is;

As “multiple” uses the same character, the use of this for unpacking is not flexible.
In this sense, breaking a tulpe down into multiple parameters is straightforward.

sima_ranjbari · June 11, 2022, 3:04am

Very nice. I got it.

Thank you so much for the quick and helpful response.

sima_ranjbari · June 11, 2022, 4:59pm

Regarding regularization in deep learning, I have seen that in tensorflow layers we have kernel_regularizer. I was wondering how using this parameter in the layer can affect the loss function as a regulariser cost function that Andrew Ng was describing.

anon57530071 · June 12, 2022, 4:37am

If any “regularizer” is specified, a penalty is calculated based on specified regularization type in a layer instance, like Dense(). These values are summed into the loss function that you specify separately. This is pretty much aligned to what Andrew talked.
By the way, there are three types of regularization for a layer. Those are;

kernel_regularizer
bias_regularizer
activity_regularizer

“kernel_regularizer”, aka weight_regularizer, is for weight regularization, and “bias_regularizer” is for bias regularization, that Andrew also introduced, but he also said that he omitted. “activity_regularizer” is for layer output.

I think this is well-designed. In some cases, we may want to prepare our own “loss function”. To add a regularization term, we just call “layer.losses” to get those. We do not need to re-calculate them in our custom loss function.

Topic		Replies	Views
Implementing dropout regularization Improving Deep Neural Networks: Hyperparameter tun coursera-platform	3	632	May 14, 2022
W1 - Exercise 3 Improving Deep Neural Networks: Hyperparameter tun coursera-platform	3	556	January 18, 2022
Regularization Week 1 Improving Deep Neural Networks: Hyperparameter tun coursera-platform	1	600	June 16, 2021
Week1 Regularization Assignment - forward_propagation_with_dropout() error Improving Deep Neural Networks: Hyperparameter tun coursera-platform	6	714	August 26, 2021
Doubt regarding D1 and D2 in dropout regularization in assignment 2 of week 1 Improving Deep Neural Networks: Hyperparameter tun coursera-platform	1	538	November 29, 2021

Dropout regularization exercise

Related topics