I passed this assignment using 16 filters for the first conv2D layer, 64 filters in the second conv2D layer, and 64 units in the first Dense layer, which usually got accuracy between .32 and .45.
The weird thing (to me) is that when I tried to boost accuracy by adding more conv filters, more dense layers, or more units in the first dense layer, accuracy would drop to around 0.04. I have experimented with adding and removing dropout layers, but they didn’t help.
I can’t think what would cause this drop. Does anyone have insights? To anyone getting training accuracy ~0.8 or more, how did you do it?
P.S. I also experimented with making the softmax layer length 24 (number of unique training samples), 25, and 26 (length of the alphabet), but I didn’t notice an effect from that.