In the Full GRU slide (Slide Page 34), I think, the update gate should multiply the previous hidden state, not the candidate state; while (1-updateGate) should multiply the candidate state.
1 Like
Thanks for your report.
Also, Slide Page 36 also needs to correct