RNN parameters understanding and vanishing gradient

wjiang · September 13, 2022, 3:50pm

Hello, can someone help provide more understanding about the weight parameters Wy and Wa, and bias by and ba? What do they do? How do you determine these parameters? It also mentioned something about gradient vanishing. These are mentioned in Sequence models as part of Deep learning specialization. I jumped directly into the sequence models and will those parameter definition and vanishing gradient introduced in previous courses? Thanks!

rmwkwok · September 15, 2022, 1:33am

I moved your thread to the category of Deep Learning Specialization Course 5.

TMosh · September 16, 2022, 1:03am

I recommend you go back to Course 1 and 2 where these concepts are introduced.

Topic		Replies	Views
Derivation of backpropagation of RNN Sequence Models coursera-platform	2	810	June 5, 2022
W1, A1, Ex. 6, Vanishing Gradients Sequence Models coursera-platform	1	408	July 13, 2023
W2 "Neural Language Model" slide missing diagram Sequence Models coursera-platform	1	494	March 18, 2023
Week1 Assignment1 Backpro question Sequence Models coursera-platform	3	593	August 16, 2021
Backpropagation Through Time and Vanishing Gradient (RNN) Sequence Models coursera-platform	10	689	September 21, 2022

RNN parameters understanding and vanishing gradient

Related topics