How does YOLO algorithm look into grids one by one and understand,a part of the object is in other grid

Chazz · September 9, 2021, 12:15pm

what is the m in yolo algorithm for each image (m,608,608,3)

ai_curious · September 15, 2021, 8:10pm

m is for image, as in members the collection of training images.

m is not related to the subject of your thread, by which I mean it has nothing to do with grid cells, grid cell size, or object size. The first part to understanding the question about ‘looking into grids’ is that it doesn’t look into grids. It looks at an entire image all at the same time. Then a bunch of predictions are made, including predictions about where object centers are and what their shapes are. The predicted center falls in exactly one grid cell. The predicted shape can be smaller, same size, or larger than one grid cell.

There are some other posts that cover the mechanisms for doing this. You might find them using search. For example, here is one.

Topic		Replies	Views
How does a cell detect a bounding box bigger than itself, YOLO? Convolutional Neural Networks	6	825	July 10, 2021
YOLO Algorithm and grid cells Convolutional Neural Networks week-3	11	87	March 19, 2025
Grids in YOLO Algorithm Convolutional Neural Networks week-3	6	413	January 15, 2024
How does YOLO know if 3 cells make 1 object? Convolutional Neural Networks	3	611	August 14, 2023
https://www.coursera.org/learn/convolutional-neural-networks/lecture/fF3O0/yolo-algorithm Convolutional Neural Networks	5	695	March 12, 2023

How does YOLO algorithm look into grids one by one and understand,a part of the object is in other grid

Related topics