Yolo Anchor Boxes

Neither of these is precisely correct.

  1. Anchor boxes shapes are predetermined, but they are not class-specific. Think it through; cars close up and cars far away are not the same size. Also, some models have been trained on data with thousands, even tens of thousands, of classes. Since anchor box number determines YOLO network output shape and thus number of predictions (and computations) you maybe can have 10 anchor boxes but you cannot have 10,000.

  2. The anchor box shapes are learned from the training data, but not by the YOLO network and not including location (b_x, b_y). Rather, the shapes are selected through running unsupervised learning on the ground truth shapes only…location is irrelevant. This also has been previously discussed extensively in the forum.

Hope this helps

1 Like