C5W4A1 Transformer Architecture with TensorFlow submission error

{mentor edit: hacks removed - no longer necessary}