https://www.coursera.org/learn/convolutional-neural-networks/lecture/fF3O0/yolo-algorithm

ai_curious · March 12, 2023, 2:54am

The image isn’t actually divided. The entire image is input, once, into the neural net, which then makes a large number of predictions against it.

It is the training data labels, not the image, that get configured to match the grid cell and anchor box (and number of classes) specifications.

X - the input, is a single image - the entire thing
Y - the training data labels, is a multidimensional object with shape according to the grid cell and anchor box count
\hat{Y} - the network output, estimated values, or predictions, is the same shape as Y.

The work of setting up the training data for YOLO is mapping the object bounding boxes to the appropriate location in Y.

Try searching the forum on something like YOLO training data to find other related discussions. Here is one such…

Topic		Replies	Views
YOLO- Training dataset Convolutional Neural Networks week-2	3	41	January 17, 2025
YOLO Algorithm and grid cells Convolutional Neural Networks week-3	11	83	March 19, 2025
Question regarding the dataset of Autonomous_driving_application_Car_detection Convolutional Neural Networks week-3	2	88	June 22, 2024
Course4 Week3: Understanding YOLO Algorithm Convolutional Neural Networks	5	815	March 18, 2025
Week 3: finding the correct cell in YOLO Convolutional Neural Networks	3	676	January 6, 2023

https://www.coursera.org/learn/convolutional-neural-networks/lecture/fF3O0/yolo-algorithm

Related topics