It’s amazing that instructor is able to articulate the core ideas so well given the abstraction of the original paper of Transformer (try read it yourself before checking the content material). It provides the necessarily details that I couldn’t get it myself by reading the paper. Not to mention that at the end we get to implement the paper using tensor-flow.
Thank you so much!