If there are 20 mini batches, And, there are 10 time steps with each time step consisting of 5000 (one-hot encoded language) units.
I was expecting the dimension to be (20, 10, 5000) i.e, 20 batches of 10 time step (row) with 5000 columns. So, I was expecting the shape of mini batch to be (m, Tx, nx)
But exercise states it to be (nx, m, Tx) – (5000, 20, 10)
I am not able to understand – Please help