I really appreciate your detailed answer and also the thread that you shared.
Below block from your thread(Detecting Multiple Objects using YOLO - Grid Cells plus Anchor Boxes) helped a lot for me to understand YOLO.
So, YOLO allows us to make S * S * B classification output in a single forward propagation by adding one more dimension(of course corresponding model change), and also we can add one more dimension C for bounding box prediction if we need, and it can be also done in a same forward propagation,
therefore we can do multiple classification and bounding box prediction in a single forward propagation which is the meaning of “You Look Only Once”.