W1A2 - How Are Shapes Determined

hungng777 · May 25, 2024, 12:30am

For the W1A2 Dinosaur assignment, I have a lot of questions about the shapes of parameters (e.g. Waa, Wax, Wya) and activation a used in the function sample().

The activation a’s (i.e. a_prev variable) shape is (100, 1). I know the number 100 is set in the sample_test() function (for the variable n_a). But what is the variable n_a?

Parameter Wax’s shape is (100, 27), parameter Waa’s shape is (100, 100), while parameter Wya’s shape is (27, 100). The number 27 is due to the 26 alphabets plus the newline character. But, where does the number 100 come from? Or a better question is how are the parameters shapes being constrained?

Thank you in advance

Weberson_Pontes · May 25, 2024, 2:46am

I did this drawing to help me understand all the shapes sizes in a RNN cell. Just remember that when you multiply 2 matrixes, the column of the first matrix must be equal to line of the second. Ex: A.shape (m, n) B.shape (n, t) => np.dot(A, B) will result in a shape (m, t).

Regarding to the value n_a, it is a hyperparameter. ‘a’ is the hidden state, thus the size of it in a Recurrent Neural Network (RNN) is an important hyperparameter that needs to be chosen carefully. This size, often referred to as the number of hidden units or the hidden dimension, depends on some factors, such as complexity of tasks, size of input data, among others.

I hope that answer your question.

Weberson.

Topic		Replies	Views
Why is dimension of Waa (100,100) in RNN example Sequence Models coursera-platform	1	553	March 29, 2022
RNN Model Wa dimension Sequence Models coursera-platform	1	531	August 20, 2022
W1 A1 dimensions of n_a and n_y? Sequence Models week-module-1 , coursera-platform	2	8	December 13, 2024
Understanding RNN Cells: Dimensions and Initialization Queries Sequence Models coursera-platform	3	469	December 31, 2023
How is W shape (3,1) and not (1,3)? Advanced Learning Algorithms week-module-1	9	391	September 26, 2023

W1A2 - How Are Shapes Determined

Related topics