GRU unit for RNN

Meir · June 2, 2021, 7:59am

Two things are unclear to me after the video on GRU unit:

When using GRU units, does the whole RNN consist of GRU units? If not, where do GRU units appear?
If the activation produced by GRU is the same as the memory, i.e. a=c, then the next unit might have no information about the previous word in the sentence, which makes no sense. What am I missing?

balaji.ambresh · June 2, 2022, 12:33pm

A NN can consist of multiple types of recurrent layers like GRU / LSTM. RNN layers are added after the embedding layer if one is present. You can read more here. A single NN can be made of multiple types of RNN layers like an LSTM and GRU.
C^{<t>} uses C^{<t-1>} and \widetilde{C}^{<t>} to decide whether to keep the candidate from current timestep or remember the past. So, one cannot jump to a conclusion that C^{<t>} == C^{<t - 1>}

Topic		Replies	Views
Definition of GRU Sequence Models	1	493	August 24, 2022
Difference in GRULM implementation and LSTM NLP with Sequence Models week-3	1	428	October 1, 2023
RNN Architecture, Why not multi-layer NN inside the cell? Sequence Models	8	273	December 12, 2023
Concept behind gates Sequence Models	15	561	December 7, 2022
Questions on inputs for GRU model NLP with Sequence Models week-2	5	724	March 9, 2023