D5W1 A1 Assignment Exercise 7 " LSTM Backward Pass"

Nelson_Tam · July 7, 2022, 4:59pm

I am trying to implement D5W1 A1 Assignment Exercise 7 " LSTM Backward Pass" but got stuck in understanding this part in parameter derivates

Parameter Derivatives

𝑑𝑊𝑓=𝑑𝛾⟨𝑡⟩𝑓[𝑎𝑝𝑟𝑒𝑣𝑥𝑡]𝑇(11)(11)dWf=dγf⟨t⟩[aprevxt]T𝑑𝑊𝑢=𝑑𝛾⟨𝑡⟩𝑢[𝑎𝑝𝑟𝑒𝑣𝑥𝑡]𝑇(12)(12)dWu=dγu⟨t⟩[aprevxt]T𝑑𝑊𝑐=𝑑𝑝𝑐˜⟨𝑡⟩[𝑎𝑝𝑟𝑒𝑣𝑥𝑡]𝑇(13)(13)dWc=dpc~⟨t⟩[aprevxt]T𝑑𝑊𝑜=𝑑𝛾⟨𝑡⟩𝑜[𝑎𝑝𝑟𝑒𝑣𝑥𝑡]𝑇(14)

The instruction here appear to ask us to concatenate the matrix a_prev with xt. but they are of different size and once I done that the following numpy error appears.

TypeError Traceback (most recent call last)
in
19 da_next_tmp = np.random.randn(5,10)
20 dc_next_tmp = np.random.randn(5,10)
—> 21 gradients_tmp = lstm_cell_backward(da_next_tmp, dc_next_tmp, cache_tmp)
22 print(“gradients[“dxt”][1][2] =”, gradients_tmp[“dxt”][1][2])
23 print(“gradients[“dxt”].shape =”, gradients_tmp[“dxt”].shape)

in lstm_cell_backward(da_next, dc_next, cache)
46 print (a_prev.shape)
47 print (xt.shape)
—> 48 concat = np.concatenate(a_prev, xt)
49 print (concat.shape)
50 dWf = np.dot(dft, (np.concatenate(a_prev, xt)).T)

<array_function internals> in concatenate(*args, **kwargs)

TypeError: only integer scalar arrays can be converted to a scalar index

For debugging purpose the shape of a_prev and xt are:

(5, 10)
(3, 10)

My lab ID is pahlbtkn

balaji.ambresh · July 7, 2022, 6:11pm

The 1st parameter of np.concatenate is a sequence of arrays. So, wrap the arrays in a sequence and call it like this:
np.concatenate((arr1, arr2))

Topic		Replies	Views
D5W1 A1 Assignment Exercise 7 LSTM Backward Pass need help Sequence Models coursera-platform	0	544	July 5, 2022
Trouble with lstm_cell_backward function Sequence Models week-module-1 , coursera-platform	4	156	May 20, 2024
DLS Course 5: Week 1 Assignment 1 Sequence Models coursera-platform	2	532	February 16, 2023
Week 1: Excersie 7 - lstm_cell_backed Sequence Models week-module-1 , coursera-platform	1	33	November 11, 2024
Week 1 - RNN Step by Step lstm_cell_backward Sequence Models coursera-platform	3	801	May 24, 2021

D5W1 A1 Assignment Exercise 7 " LSTM Backward Pass"

Parameter Derivatives

Related topics