Week3: Raw Output from YOLO to get final predictions

Manas_Rastogi · May 6, 2023, 5:45pm

One topic that the lectures do not really touch upon is transforming raw output from YOLO to get final predictions. Or, in our programming assignment, basically whatever goes on inside the yolo_head function present in yad2k.models.keras_yolo file

The bounding box xy coordinates go through sigmoid activation + some offsetting and it’s height and width are scaled exponentially and by anchor height and width. Any insights / intuition from anyone on what exactly is going on here?

Was trying to read blogs online and here’s what one of them mentions…

Would be really helpful if someone can give more intuition around anchor boxes, I have a feeling that there’s a lot more that we need to grasp here…

Another thing, just looked at the formula for the localization loss in YOLO. If it’s computed as the sum of the squared error difference between the ground truth boxes and predicted boundary boxes, then what role exactly does anchor boxes have to play. All I can grasp is that they are there to just hold multiple classes… getting very confused now with these anchor boxes!

TMosh · May 6, 2023, 6:39pm

I recommend you search the forum for the term “anchor box”. One of the community members has created many posts about exactly this topic.

ai_curious · May 6, 2023, 6:55pm

https://community.deeplearning.ai/search?context=topic&context_id=327390&q=YOLO%20%40ai_curious&skip_context=true

Even I didn’t know it was that many. I need to get a life

TMosh · May 6, 2023, 7:06pm

@ai_curious, your service here is greatly admired.

ai_curious · May 6, 2023, 7:16pm

Anchor boxes don’t play an explicit role in the localization loss. They influence the shapes of the predicted bounding boxes, and they determine which locations in the ground truth matrix have non-zero training data values. But their shapes are not explicitly part of the loss computation.

Also a reminder that loss in YOLO is not just localization loss…it includes classification and object presence/absence. And localization loss has two components- object center coordinates, and shapes.

paulinpaloalto · May 7, 2023, 7:01am

Here’s one of the YOLO posts from ai_curious about Anchor Boxes that I have bookmarked. That one describes how the Anchor Boxes are derived. Then it links to this one, which describes how they are actually used by the algorithm.

Manas_Rastogi · May 7, 2023, 2:40pm

These two posts are super, thanks for pointing to them and of course thank you sir @ai_curious for writing them

Topic		Replies	Views
Question about Week3 Yolo Programming assignment Convolutional Neural Networks week-3	3	263	April 19, 2024
Week 3: Car Detection with Yolo Convolutional Neural Networks	4	712	September 22, 2021
Week 3 video: non max suppression Convolutional Neural Networks	5	617	April 2, 2023
C4W3 YOLO training, anchor boxes and network's output tensor Convolutional Neural Networks week-3	6	558	January 8, 2024
About the prediction of yolo boundary box prediction Convolutional Neural Networks week-3	1	28	September 21, 2024

Week3: Raw Output from YOLO to get final predictions

Related topics