Week 3 Exercise 6 - compute_total_loss. Why transpose?

That is just the definition of the TF APIs: they expect “samples first” as the arrangement of the dimensions. Here’s a thread which discusses this whole question.

1 Like