loss-functions-C4W3

paulinpaloalto · April 22, 2022, 2:07am

The loss function to use in a given case is always a choice that you need to make. Whether you use a distance based loss (e.g. MSE or MAE) versus a “cross entropy” function depends on what your output represents: either a continuous real number or something like a probability distribution (more common in “classification” problems).

The case of YOLO is interesting in that the model is outputting a number of different types of outputs, including bounding boxes and classifications (pedestrian, car, tree, stop light …). So you may need a “hybrid” approach to the cost function in which you are adding terms for each aspect of the output. Maybe the bounding box needs something like MSE whereas the classification of the contents needs something more like cross entropy. I think that is what Prof Ng was getting at in that section that you quote.

Here’s another recent thread that just talks in general about the differences between distance and entropy style loss functions, but not specific to YOLO.

Topic		Replies	Views
Loss function used in practice for object localization and classification Convolutional Neural Networks coursera-platform	2	552	July 16, 2021
Confusion between two Loss functions Neural Networks and Deep Learning coursera-platform	4	531	July 18, 2023
Loss Function of Week 3 Neural networks topic Neural Networks and Deep Learning coursera-platform	4	647	February 12, 2024
YOLO Loss Function Convolutional Neural Networks coursera-platform	1	552	July 14, 2021
Is YOLO a regression or classification algorithm? Convolutional Neural Networks coursera-platform	11	1090	February 23, 2023

loss-functions-C4W3

Related topics