C5W1 A3: Keras - Shared layers / shareable weights

AIGeorge · April 18, 2022, 11:34am

Hi all,

Can anyone please help me with interpreting the functioning of shared layers using keras?

Does the implementation of shared layers mean, we are defining the architecture of the LSTM layer only once and by that for all layers of the model?

Hence, on the one hand side it is efficient programming. But, due to shared weights, isnt it also harming accuracy of our model predictions? Or are weights only shared for initialization?

Cheers and best,
George

balaji.ambresh · April 18, 2022, 2:49pm

If you look at the Building_a_Recurrent_Neural_Network_Step_by_Step, i.e. C5W1A1, exercise 4, you’ll notice that the same LSTM cell is expended over multiple timesteps. At each timestep, the internal weights like W_i, W_f are not re-initialized but shared over time.
This is one way of saying not to create an LSTM_cell for each timestep but use the same instance.

paulinpaloalto · April 18, 2022, 7:40pm

The point is that this is the fundamental RNN architecture: there is one “cell” (either with or without LSTM) and that same cell is used in all timesteps. If you missed that point, it might be worth watching the lectures again. Note that during training and back propagation, the data that the cell is seeing changes at each timestep, which means that the gradients contributed by each timestep will be different, but they all update the same shared weights.

Topic		Replies	Views
Backpropagation in RNN weight sharing Sequence Models	4	818	February 23, 2022
Shared weight network in W5A3 - Music inference Sequence Models week-1	1	12	July 8, 2024
Backprop with shared layers Sequence Models	1	544	June 25, 2021
Reusing weights question Sequence Models general	5	65	June 23, 2024
Why is it important to use the same weights to predict all the words in the sentence? Sequence Models	1	520	June 23, 2021

C5W1 A3: Keras - Shared layers / shareable weights

Related topics