Hi there,

For multi classification problems using 'Neural Networks with Softmax", how do I interpret the the output from final layer?

Let’s say there are 10 classes in outcome variable and size of training set is 1000 rows. In final layer, do we predict probability (Y= k | X) for all observations in training set. If so an observation will have 10 probability values, 1 for each class. I am unable to understand how do I make sense of final layer output.

Thanks,

Uma