Week 2 lesson : softmax

Pankaj_Shukla · March 2, 2024, 11:45pm

Link: https://www.coursera.org/learn/advanced-learning-algorithms/lecture/mzLuU/softmax

When lesson is started logistic regression example is given for y=1 and y=0 and then the equations are built for softmax but the problem I see is that softmax equations are built for y=1,2,3…N and not y =0,1,2,3. Isnt there a possibility that output matches none of 1,2,3,4. Even mathematically, how you can say loss is -log a2 or - log a3 because it was derived based on y=0 earlier. Can someone explain?

TMosh · March 3, 2024, 12:03am

Softmax doesn’t use y as integers.

Softmax takes ‘n’ floating point inputs (often they are scaled from 0.0 to 1.0), and re-scales them so that their sum equals exactly 1.0.

rmwkwok · March 3, 2024, 5:04am

Hello @Pankaj_Shukla,

You are right that labels should normally be zero-based, and this is the convention in tensorflow and many other packages. The lecture are just discussing a one-based approach so we are just going to have to keep in mind the difference.

Cheers,
Raymond

Topic		Replies	Views
Loss function for softmax regression when a = 0 Advanced Learning Algorithms week-2	2	392	December 1, 2023
Softmax Loss Function for single example Advanced Learning Algorithms week-2	18	578	December 30, 2022
Practice quiz: Multiclass Classification Advanced Learning Algorithms week-2	1	532	June 18, 2022
Improved Implementation of Softmax - Trouble Understanding the Logic Advanced Learning Algorithms week-2	4	33	August 18, 2024
Minor error in video - Course 1, Week 2 Neural Networks and Deep Learning	3	534	March 15, 2022

Week 2 lesson : softmax

Related topics