Encoding the anchor boxes

anon57530071 · August 5, 2022, 8:50am

If you look at predict(), then, I think you can fill the gap.

The final output from Darknet, a convolutional network for Yolo, is as you see,

conv2d_22 (Conv2D) (None, 19, 19, 425) 435625 ['leaky_re_lu_21[0][0]']

This is yolo_model_outputs in the following code block from predict().

yolo_model_outputs = yolo_model(image_data)
yolo_outputs = yolo_head(yolo_model_outputs, anchors, len(class_names))
out_scores, out_boxes, out_classes = yolo_eval(yolo_outputs, [image.size[1],  image.size[0]], 10, 0.3, 0.5)

Then, the next step is yolo_head, which can be seen at “./yad2k/models/keras_yolo.py”. So, what you want is in there.

Here is an overview of object detection/localization steps by Yolo.
Please also see this link that is written by ai_curious for anchor box related operations.

The output from the network includes all candidate boxes. As you see, an image is split into 19x19 grid. And, each has 5 anchor blocks. (The center of each anchor box is inside a grid selected.) And, each anchor block information has box (4 (position related info) + 1 (confidence) + 80 (probability distribution)) length.
Yolo head extracts those information from the network output (19x19x425), and creates a list of 4 tuples, i.e, (box_xy, box_wh, box_confidence, and box_class_prob).
Then, Yolo Eval, that you wrote, works on filtering and non_max_suppression to get the final boxes with class information.

I think the above covers your question. Hope this helps.

Topic		Replies	Views
Yolo Anchor Boxes Convolutional Neural Networks	13	1170	October 30, 2023
How do you setup "yolo_anchors.txt"? Convolutional Neural Networks	6	678	February 5, 2023
Programming Exercise - Anchor Boxes Convolutional Neural Networks	3	683	June 19, 2022
C4W3 YOLO training, anchor boxes and network's output tensor Convolutional Neural Networks week-3	6	534	January 8, 2024
Week 3 video: non max suppression Convolutional Neural Networks	5	602	April 2, 2023

Encoding the anchor boxes

Related topics