Course 4, Week3: question on the size of the bounding box

yipan9305 · September 28, 2021, 2:20pm

Hi,
I don’t understand how the algorithm can detect the size of the bounding box if part of an object is outside of the grid, in which the center of the object is located (at 11:27 minute in Course 4, week3 video: “Bounding box predictions”). I understand that when a grid is passed into a conv-net, it can detect the center of the object. However, a portion of the object is outside of current grid and was located in a neighboring grid (like the image cropped from Andrew’s slide). How the total bounding box size is determined in the algorithm?

Thank you!

ai_curious · September 28, 2021, 6:23pm

Sliding windows does pass an image region into the network, and it doesn’t do well with objects that weren’t fully within the region. In contrast, YOLO does not pass an image region into the network. It passes the entire image. Each grid cell is predicting at the same time on the same input.

yipan9305 · September 28, 2021, 7:38pm

Thanks, ai_curious. I went through several threads you and others posted. Need some time to digest them.

Topic		Replies	Views
How does a cell detect a bounding box bigger than itself, YOLO? Convolutional Neural Networks	6	825	July 10, 2021
YOLO concept confusion Convolutional Neural Networks	1	643	November 3, 2021
YOLO Algorithm and grid cells Convolutional Neural Networks week-3	11	87	March 19, 2025
YOLO - How does Bounding box get identified when Object spawns multiple sliding windows(Grids) Convolutional Neural Networks	2	731	November 25, 2021
[C4W3] YOLO grid question Convolutional Neural Networks	1	668	August 26, 2021

Course 4, Week3: question on the size of the bounding box

Related topics