No, it’s because Prof Ng used a different order of the dimensions than the TF loss function expects. Here’s a thread which discusses that point.
No, it’s because Prof Ng used a different order of the dimensions than the TF loss function expects. Here’s a thread which discusses that point.