WA and Ba for RNN Model

Moustafa_hussein · January 23, 2023, 12:33pm

I just watched the RNN video and andrew talk about the parameters WA and BA and he used them on ever X so my question are does these parameter is the same on every word or they are different?

balaji.ambresh · January 23, 2023, 1:19pm

There are 2 inputs to an RNN layer. They are:

a^{<t-1>} that comes from the previous timestep
x^{<t>} which refers to the data from the current timestep.

W_a = [ W_{aa}; W_{ax}] i.e. 2 matrices concatenated horizontally.
W_b contains the bias term.

Using the 2 matrices above, we perform W_{aa} \cdot a^{<t-1>} + W_{ax} \cdot x^{<t>} + W_b

Moustafa_hussein · January 23, 2023, 1:21pm

is Waa and Wax is different across every new input right ?

balaji.ambresh · January 23, 2023, 1:24pm

The matrices are shared across all timesteps. You’ll see more about this in the lectures and programming exercises on backpropagation of RNN and LSTM.

It is because the internal parameters are shared across all timesteps that the term BPTT exists.

Topic		Replies	Views
A single RNN unit Sequence Models	15	544	January 13, 2023
Question on RNN Architecture Sequence Models	3	333	October 5, 2023
Why same Wax , Way ,Waa for each step Sequence Models	2	524	December 30, 2021
RNN Model Wa dimension Sequence Models	1	529	August 20, 2022
I have some fundamental questions Sequence Models	3	535	October 1, 2021

WA and Ba for RNN Model

Related topics