Hello,
I’ve been trying to pass this assignment for weeks now, but due to a serious underfitting problem (a very slow decrease in the loss function), I could not pass it. I sent my code to one of the mentors (@gent.spah ) twice, and I double-checked with him that my code is written completely correctly. With the given parameters in the assignment, I get the following result for fine-tuning (Exercise 10):
Start fine-tuning!
batch 0 of 1000, loss=1.8442183
batch 10 of 1000, loss=1.8161771
batch 20 of 1000, loss=1.7634711
batch 30 of 1000, loss=1.7045426
batch 40 of 1000, loss=1.6465161
batch 50 of 1000, loss=1.5921935
batch 60 of 1000, loss=1.5430243
batch 70 of 1000, loss=1.5005184
batch 80 of 1000, loss=1.466491
batch 90 of 1000, loss=1.4419613
batch 100 of 1000, loss=1.4258995
…
batch 950 of 1000, loss=1.297606
batch 960 of 1000, loss=1.2968262
batch 970 of 1000, loss=1.2960507
batch 980 of 1000, loss=1.295279
batch 990 of 1000, loss=1.2945113
Done fine-tuning!
As you may see, the loss doesn’t even get close to the value mentioned in the expected output section, which is around 0.0004. I’ve been playing with the hyperparameters for weeks and found out that by increasing the learning rate to 1 for the first 10000 batches, and to 10 for the next 10000 batches, I could reduce the loss down to around 0.001, which was still not low enough for the model to pass the assignment’s criteria, and due to Colab’s time limits, I couldn’t let it run for longer hours.
Now, I have completed the whole course (all four weeks except the 2nd week) and only this assignment is left for me to finish the course. Considering my conversation with @gent.spah , and double-checking with the Eager Few-Shot Object Detection Colab, I am pretty sure my code is written correctly, but there are possibly some version-related issues, which doesn’t allow the model to learn as it is expected to do.
At this point, I don’t know anyone responsible for this assignment to contact. I would like to get some feedback to pass this assignment (I can dm my Colab), or if this issue cannot be solved by May 27th, I would like to cancel my subscription for this course. This is the 8th course that I’m completing with Deeplearning.ai, and honestly, this assignment has been the most annoying one to troubleshoot in the past year I’ve been taking courses here. Considering there are many similar posts in this forum regarding the same problem I’ve had in this specific assignment, some serious modifications should be made to this Colab.
Thanks in advance for any help.