C2 W3 Tensorflow assignment

SAURAV_DAGAR · January 19, 2022, 2:10pm

Hi,
Why we didn’t use sigmoid function while training model in the assignment?
However we have just calculated Z3 and passed it to the function that calculates cost.
Please help

paulinpaloalto · January 19, 2022, 6:00pm

We did, but the way it happens is by using the from_logits = True parameter to the loss function to tell it to do both the activation and loss computations together. That is both more efficient and more numerically stable. For example, it’s easier to handle things like saturated sigmoid values which would normally cause NaN results.

Well, note that this is a multiclass case, so it’s not sigmoid, but softmax as the activation. That’s why the loss function is “categorical” cross entropy instead of “binary” cross entropy. But the same mechanism applies in both cases.

They give you a link to the documentation for the cost function in the instructions. It might be worth actually reading it with the above description in mind. What you will find from this point forward is that Prof Ng always uses this method.

KANDOORI_HARINI · October 26, 2022, 5:09pm

well, note that this is a multiclass case, so it’s not sigmoid, but softmax as the activation. That’s why the loss function is “categorical” cross entropy instead of “binary” cross entropy. But the same mechanism applies in both cases.

paulinpaloalto · October 26, 2022, 6:05pm

Yes, that is a quote from my previous reply. Do you have an additional point or question?

Topic		Replies	Views
(Tensorflow assignment )Why not compute A3 in forward_prop function? Improving Deep Neural Networks: Hyperparameter tun coursera-platform	4	745	May 11, 2021
TensorFlow use of Z3 instead of A3 Improving Deep Neural Networks: Hyperparameter tun coursera-platform	2	624	May 10, 2022
C2_W3_multiclassification Improving Deep Neural Networks: Hyperparameter tun coursera-platform	3	516	September 5, 2022
Course2, Week3, cannot compute_cost Improving Deep Neural Networks: Hyperparameter tun coursera-platform	2	537	August 25, 2022
Where is the activation function in Week 2 - Transfer Learning assignment Convolutional Neural Networks coursera-platform	6	520	July 10, 2022

C2 W3 Tensorflow assignment

Related topics