At the end of Exercise 7, the test code requires the model weight shapes to be:
(3, 3, 256, 24)
(512,)
(256,)
I instead get:
(3, 3, 256, 24)
(256,)
(256,)
I create my box_predictor checkpoint with the _base_tower_layers_for_heads and _box_prediction_head loaded from my model. I then create the model checkpoint with this _box_predictor checkpoint and with the _feature_extractor from the model.
This is almost cut and paste from the interactive_eager_few_shot_od_training colab notebook.
Despite this I always get a value error regarding incompatible tensor shapes when running the prediction with my model. This only happens the first time, the second time I run it, it runs fine but then I get the wrong model shapes at the end as discussed.
My training loss doesn’t decrease and I’ve traced it back to this.
Any help would be really appreciated, I’ve been stuck on this assignment for about a week now