RNN Architecture

ajaykumar3456 · September 13, 2022, 7:40pm

Hello Team,

I feel extremely happy to start the 5th course of this deep learning specialization.
My question is related to RNN architecture where Tx = Ty

We are passing A[1] to the 2nd layer in the RNN. I believe that A<1> is not equal to yhat<1>. Correct me If I am wrong.
Why are we passing A[1] instead of yhat<1> to the 2nd layer in the RNN?

Regards,
Ajay

reinoudbosch · October 17, 2022, 5:14pm

Hi ajaykumar3456,

a<1> is the (trainable or trained) activation value of the RNN that determines the translation from input x<1> to output y<1>. In order to be able to determine the relation between x<2> and yhat<2>, the information captured by a<1> is passed to the second time step, where it is modified to a<2> and determines the value of yhat<2> on the basis of x<2>. This allows information from the previous step to help out in producing the output at the step that follos.

So a refers to the activation values in the RNN, whereas x is the input and yhat the output. The output depends on the activation values through an function with parameters, selecting the output value based on the activation values. See the video at 10:00. So a is different from yhat, and a is needed to determine the translation from x to y by passing information from one time step to the next.

I hope this clarifies.

Topic		Replies	Views
RNN activation vs output at a certain time step Sequence Models coursera-platform	1	340	September 24, 2023
RNN lecture and programming exercise: activation 0 Sequence Models coursera-platform	3	564	April 13, 2022
Language Model and Sequence Sequence Models coursera-platform	1	502	August 7, 2022
Question on RNN Architecture Sequence Models coursera-platform	3	333	October 5, 2023
RNN Model - y Label Meaning Sequence Models week-module-1 , coursera-platform	4	95	June 3, 2024

RNN Architecture

Related topics