When you complete the assignment, the model performance is accuracy 0.86 and loss 0.55
I thought I’d play around a bit with the model and see how different layers affect the performance… So I started by removing all the layers from Stage 3, 4, 5 (including the last avg pool). I basically reduced the model just to Stage 1 and 2 with one FC layer and suddenly my model performs much better (accuracy 0.933 and loss 0.18
If I recall correctly, the model in the assignment was not trained to full convergence, due to the need to limit the amount of processing power required of Coursera’s server.