Hi there,
Recently, I have spent some time to study Yolo v1 and plan to modify it to implement a specific goal: detect car’s license plate.
But I am NOT sure if my idea will work in theory, so I present my thought as below, would you pls help review it and give me any suggestion?
Modification on net structure
The code of Yolo v1 I got is design to detect 20 classes objects. the last two layers are:
dense_2 (Dense) (None, 1470)
yolo_outputs (YoloOutputs) (None, 7, 7, 30)
Due to images I will use to train the model always includes a license plate, I plan to move class information in the output and change the last 2 layers as follows:
dense_2 (Dense) (None, 490)
yolo_outputs (YoloOutputs) (None, 7, 7, 10)
Modification on loss function
the original loss:
loss = class_loss + (0.5 * no_object_loss + object_loss) + 5 * box_loss
Because of the same reason, my version:
loss = (0.5 * no_object_loss + object_loss) + box_loss