Convolution Confusion (YOLO/UNets)

paulinpaloalto · May 9, 2024, 3:45pm

Not sure what you mean by this, but computing the center of the box is pure geometry of course. So I assume you mean finding the centroid of each recognized object, so that it can pick which grid cell the object is assigned to. It just learns that through training. It requires a huge amount of data to train an algorithm like YOLO and all the data is labeled with all the info including object types and bounding boxes. You have a loss function which is a hybrid function, since it needs to deal with classifications as well as regression style outputs. Prof Ng does not really discuss how the training works, but there are a number of very detailed threads here on the forum about YOLO. Here’s one that covers the training.

They are learned a priori using a different algorithm. Here’s a thread about that. And here’s a thread about how they are applied.

Topic		Replies	Views
Questions about YOLO Convolutional Neural Networks	13	2435	January 23, 2025
YOLO Algorithm and grid cells Convolutional Neural Networks week-3	11	87	March 19, 2025
A clarification about Image Classification and Localization Algorithm and YOLO Convolutional Neural Networks	2	715	August 28, 2022
Yolo centroids/conv impl Convolutional Neural Networks week-3	15	233	May 15, 2024
YOLO algorithm bounding boxes car detection Convolutional Neural Networks	1	609	January 23, 2022

Convolution Confusion (YOLO/UNets)

Related topics