Week 1 Assignment1 Softmax function

sahildatamaster · November 28, 2024, 11:03am

In provided Softmax function (file: rnn_utils.py)
Why is it e_x = np.exp(x - np.max(x))?
Shouldn’t it be just e_x = np.exp(x)?
Why np.max(x) is subtracted from x?

lukmanaj · November 28, 2024, 12:36pm

Hi @sahildatamaster, the subtraction is for numerical stability in case the numbers get big. We do not want to be getting infinity while taking the exponents. Also, it does not affect the final calculation since the calculation is normalized.

sahildatamaster · November 28, 2024, 12:50pm

Thanks, Noted!

Topic		Replies	Views
Deep Learning specialization course softmax implementation has `z - np.max(z)`. Why? Sequence Models week-module-1	9	128	September 12, 2025
W2_A1_Exercise 7 - softmax Neural Networks and Deep Learning coursera-platform	2	522	January 14, 2023
Softmax_Regression Improving Deep Neural Networks: Hyperparameter tun coursera-platform	4	454	August 1, 2023
Why Does Softmax Specifically Use the Exponential Function 𝑒^x? Advanced Learning Algorithms week-module-2 , ai-discussions , coursera-platform	2	53	November 24, 2025
W2_A1_Softmax_test(softmax) Neural Networks and Deep Learning coursera-platform	2	460	June 23, 2023

Week 1 Assignment1 Softmax function

Related topics