Queries regarding YOLO and Sliding window

paulinpaloalto · December 23, 2024, 3:56pm

To give a little more detail on question 2) in addition to Kader’s excellent response, note that the training of YOLO to recognize objects is not as focussed on the grid cells as you might expect. The grid cells are primarily used as a convenient way to organize the presentation of the results. There is no requirement that an object be contained completely within a grid cell, but the object is assigned to the grid cell that contains the centroid of the object. That also makes the NMS post processing more efficient, since it’s unlikely that two objects presented in the output are really the same object if their centroids are in different grid cells.

YOLO is by far the most sophisticated algorithm we have seen so far in DLS. There are a number of threads on the forum that explore various aspects of YOLO in quite a bit more detail than is covered in the lectures. For example, here’s one that talks about how grid cells and anchor boxes are used in YOLO. And here’s one that talks about the Non-Max Suppression that I referred to earlier.

Topic		Replies	Views
Questions about sliding window and YOLO Convolutional Neural Networks coursera-platform	4	764	January 12, 2022
Week 3 Yolo Doubt About Sliding Window Convolutional Neural Networks coursera-platform	7	776	August 18, 2024
Questions about YOLO Convolutional Neural Networks coursera-platform	13	2529	January 23, 2025
What is the exact difference between convolutional sliding windows and the YOLO algorithm? Convolutional Neural Networks week-module-3	1	22	July 8, 2025
Yolo centroids/conv impl Convolutional Neural Networks week-module-3 , coursera-platform	15	238	May 15, 2024

Queries regarding YOLO and Sliding window

Related topics