Object detection using yolo

ai_curious · March 13, 2023, 5:00pm

I say that because we’re considering the case where there is only one object, but two grid cells ‘claim’ it to use your word. The object center is only in one of those locations, so any others are mistakes.

The key to understanding how a bounding box prediction can be larger than one grid cell is this diagram from the paper…

Bounding Box width and height b are multiples of anchor box width and height. Here p is used for anchor box because the paper refers to them as priors. The anchor box shape is multiplied by e^{t}, where t is the direct output of the network. e^t can be any positive number. If t \gt 0 then e^t > 1. and b will be larger than p, and even larger than grid cell size.

Here’s another recent thread that should resonate…

Topic		Replies	Views
Course 4 Week 3 YOLO algorithm Convolutional Neural Networks coursera-platform	4	546	July 11, 2023
Detecting Multiple Objects using YOLO - Grid Cells plus Anchor Boxes Convolutional Neural Networks coursera-platform	6	1635	March 16, 2024
How does YOLO know if 3 cells make 1 object? Convolutional Neural Networks coursera-platform	3	621	August 14, 2023
YOLO - How come algortihm predicts mutiple bounding box without knowing cordinates of it? Convolutional Neural Networks coursera-platform	2	637	December 2, 2021
Non-max supression Clarification Convolutional Neural Networks coursera-platform	2	535	October 10, 2021

Object detection using yolo

Related topics