Question about week 3 assignment

mehdi_sepehri · August 8, 2022, 9:52pm

Hi! I have a question about the forward_propagation function in assignment week 3. Why do we return Z4 just without using any activation function for that layer?

paulinpaloalto · August 8, 2022, 11:28pm

I forget whether Prof Ng discusses this anywhere in the lectures, but it turns out that the TF/Keras loss functions all support a selection of whether the inputs are “logits” (meaning the linear activation output) or actual “post activation” values. The argument that controls this is from_logits and it takes a Boolean value and defaults to False. Have a look at the documentation for TF categorical cross entropy loss. The reason they offer the from_logits = True mode is that it is more efficient and more “numerically stable” to compute the activation and the loss at the same time. For example, it becomes easier to deal with the “saturation” case in which some of the outputs turn out to be exactly 0 or exactly 1. That never happens from a “pure math” point of view, but we are dealing with finite floating point representations here, so it can actually happen. In those cases, the loss would be undefined if you don’t handle that case (NaN or Inf).

So Prof Ng always uses from_logits = True mode from this point forward. The activation function is still being applied, but it happens “inside” the loss function. The same option exists for binary cross entropy loss and the sparse version of categorical cross entropy.

Topic		Replies	Views
Why doesn't forward_propagation contain the activation values? Improving Deep Neural Networks: Hyperparameter tun	7	500	February 3, 2023
Course2, Week3, cannot compute_cost Improving Deep Neural Networks: Hyperparameter tun	2	537	August 25, 2022
Problem with Course 2 Week 3 Assignment Improving Deep Neural Networks: Hyperparameter tun	6	757	February 22, 2023
Week 3 - Assignment - compute_total_loss - try to set from_logits=False Improving Deep Neural Networks: Hyperparameter tun	5	15738	July 23, 2023
Math behind "tf.keras.metrics.categorical_crossentropy" Improving Deep Neural Networks: Hyperparameter tun	4	872	September 24, 2022

Question about week 3 assignment

Related topics