What exactly does the improved implementation of softmax video mean?

rmwkwok · July 27, 2022, 8:14am

Hello @Nhat_Minh, welcome to our community!

The motivation behind the improvement is, tensorflow works more accurately with the 3rd equation on the left then with the 2nd equation on the left. We can see that in the 3rd equation, the term a never shows up, which means that by adopting the 3rd equation, tensorflow does not need to calculate a. This is good because calculating a out can generate numerical inaccuracy which is not favourable.

To make tensorflow works without calculating a out, we need to change the activation in the output layer from sigmoid to linear, because having sigmoid there is the reason for tensorflow to compute a out. Using linear actually means that we do not need any activation function. Therefore, changing from sigmoid to linear means that we are changing from passing a = g(z) into the loss to passing z into the loss.

Moreover, we need to let Tensorflow knows we are passing z into the loss instead of a, because tensorflow cannot detect the change itself. And we notify tensorflow by adding from_logits=True there. Also, logit is the name for z.

Now, with these two code changes, we enjoy a more accurate process of model training.

Cheers,
Raymond

Topic		Replies	Views
TensorFlow use of Z3 instead of A3 Improving Deep Neural Networks: Hyperparameter tun coursera-platform	2	624	May 10, 2022
Why doesn't forward_propagation contain the activation values? Improving Deep Neural Networks: Hyperparameter tun coursera-platform	7	500	February 3, 2023
Improved implementation of softmax - Neural network training \| Coursera Advanced Learning Algorithms week-module-2	1	68	June 25, 2024
Question about week 3 assignment Improving Deep Neural Networks: Hyperparameter tun coursera-platform	1	689	August 8, 2022
DLS Course2: Week 3 Exercise 6 (compute_total_loss method) Improving Deep Neural Networks: Hyperparameter tun coursera-platform	15	1851	July 31, 2024

What exactly does the improved implementation of softmax video mean?

Related topics