Not able to understand

Shubham_Singh3 · June 15, 2022, 6:00pm

not able to understand why * So, the shape of a mini-batch is (𝑛𝑥,𝑚,𝑇𝑥)* this shape is used in RNN.

anon57530071 · June 16, 2022, 6:44am

First of all, please clarify where are you referring ? Video ? Assignment of which week ?

Assuming that you refer W2A1 assignment, here is the way to think about the dimension.
Of course, if you design the system, you can chose your preferable definition.

In this assessment, we handle the time series data. Its length is Tx. And, data x is one dimensional input vector. The length of the vector is n_x. We want to handle multiple data at one time. The size of this mini-batch is m. So, the input data is, of course, 3-dimensional array.
Then, the design is which should be which dimension.
As this is the time-series data, we want to process data at time t at one time. In this sense, it is very natural to set “time” to the last dimension like this.

When we get a slice at tiime T_n, it will be 2D matrix, with the dimension reduction for the last axis. I think all agree this point.

Then, the next design point is which (n_x or m) should be the first dimension ? Actually, this is up-to your design. But, one design point is how we can perform all calculations efficiently.
In RNN cell, there are several matrix operation (dot product). If we see rnn_cell_forward(), a time slice of input data, x_t is used in here. It’s np.dot(W_{ax},\ x_t).

From a dot product view point, it is better for x_t to have one sample in a column, not row. (Otherwise, we need to “Transpose” it.).

So, the shape of each time slice is as follows.

In net, the input shape becomes (n_x, m, T_x).

Of course, this dimension definition is for this assignment. You can design any input shape with considering data format, slices, definitions of other variables (Shape, etc.), and performance.

Pankaj_Giri · October 29, 2024, 1:32pm

I am confuse what (nx, m) represent, is it a complete sentence of m words having nx length. if that is the case, then Tx was supposed to represent the word at T1, T2 … Tx right?

Dinesh_Bishwakarma · October 29, 2024, 3:15pm

It’s really simple. Let’s say you have 3 words. “The cat sat”. In above image m1 = “The”, m2 = “cat”, m3 = “sat”. Now each m’s have nx number of rows. In case of week 1 assignment 1. Let’s say xt_tmp = (3,3). Then, “The” has 3 rows. “cat” also has 3 rows and “sat” also has 3 rows (see the image above).

Topic		Replies	Views
Shape of mini-batches of input x Sequence Models week-module-1 , coursera-platform	6	258	May 2, 2024
Shape of the inputs to the RNN Sequences, Time Series and Prediction week-module-3	2	512	May 2, 2023
Doubt regarding the input examples in RNN Sequence Models coursera-platform	9	466	June 27, 2023
Input shape dimension for time series data Sequences, Time Series and Prediction week-module-2 , week-module-3	2	63	October 25, 2024
Week 1 assigniment 1 Sequence Models week-module-1 , coursera-platform	7	41	August 13, 2024

Not able to understand

Related topics