D5W1 A1 Assignment Exercise 7 LSTM Backward Pass need help

Nelson_Tam · July 5, 2022, 1:14am

I am trying to implement D5W1 A1 Assignment Exercise 7 " LSTM Backward Pass" but got stuck in understanding this part in parameter derivates

Parameter Derivatives

𝑑𝑊𝑓=𝑑𝛾⟨𝑡⟩𝑓[𝑎𝑝𝑟𝑒𝑣𝑥𝑡]𝑇(11)(11)dWf=dγf⟨t⟩[aprevxt]T𝑑𝑊𝑢=𝑑𝛾⟨𝑡⟩𝑢[𝑎𝑝𝑟𝑒𝑣𝑥𝑡]𝑇(12)(12)dWu=dγu⟨t⟩[aprevxt]T𝑑𝑊𝑐=𝑑𝑝𝑐˜⟨𝑡⟩[𝑎𝑝𝑟𝑒𝑣𝑥𝑡]𝑇(13)(13)dWc=dpc~⟨t⟩[aprevxt]T𝑑𝑊𝑜=𝑑𝛾⟨𝑡⟩𝑜[𝑎𝑝𝑟𝑒𝑣𝑥𝑡]𝑇(14)

The instruction here appear to ask us to concatenate the matrix a_prev with xt. but they are of different size and once I done that the following numpy error appears.

TypeError Traceback (most recent call last)
in
19 da_next_tmp = np.random.randn(5,10)
20 dc_next_tmp = np.random.randn(5,10)
—> 21 gradients_tmp = lstm_cell_backward(da_next_tmp, dc_next_tmp, cache_tmp)
22 print(“gradients["dxt"][1][2] =”, gradients_tmp[“dxt”][1][2])
23 print(“gradients["dxt"].shape =”, gradients_tmp[“dxt”].shape)

in lstm_cell_backward(da_next, dc_next, cache)
46 print (a_prev.shape)
47 print (xt.shape)
—> 48 concat = np.concatenate(a_prev, xt)
49 print (concat.shape)
50 dWf = np.dot(dft, (np.concatenate(a_prev, xt)).T)

<array_function internals> in concatenate(*args, **kwargs)

TypeError: only integer scalar arrays can be converted to a scalar index

For debugging purpose the shape of a_prev and xt are:

(5, 10)
(3, 10)

My lab ID is pahlbtkn

Topic		Replies	Views
D5W1 A1 Assignment Exercise 7 " LSTM Backward Pass" Sequence Models coursera-platform	1	592	July 7, 2022
Trouble with lstm_cell_backward function Sequence Models week-module-1 , coursera-platform	4	122	May 20, 2024
Course 5 week 1 - lstm_cell_forward Sequence Models coursera-platform	8	979	July 19, 2021
C5W1 Exercise 8 lstm_backward Sequence Models coursera-platform	3	698	January 22, 2024
DLS Course 5: Week 1 Assignment 1 Sequence Models coursera-platform	2	521	February 16, 2023

D5W1 A1 Assignment Exercise 7 LSTM Backward Pass need help

Parameter Derivatives

Related topics