C5W1: LSTM forget gate

PiyGupta · July 27, 2021, 6:52pm

Literature says (and copied from C5W1A1):
The “forget gate” is a tensor containing values between 0 and 1.

If a unit in the forget gate has a value close to 0, the LSTM will forget the stored state in the corresponding unit of the previous cell state.
If a unit in the forget gate has a value close to 1, the LSTM will mostly remember the corresponding value in the stored state.

Intuitively, it kind of seems off to me. Shouldn’t it be called remember gate instead?
Do we follow this convention due to legacy reasons only or am I missing something?

Topic		Replies	Views
The forget (vs. remember) gate Sequence Models coursera-platform	1	298	November 9, 2023
Typo in C5W1A1: remembered -> forgotten Sequence Models coursera-platform	4	531	March 12, 2022
W1-Ex3-lstm_cell_forward Sequence Models coursera-platform	1	518	December 12, 2022
Forget gate vs. Update gate in LSTM Sequence Models coursera-platform	2	548	December 20, 2021
W1 Quiz: Inconsistent grading logic applied to Question 9: Update Gate and Forget Gate Sequence Models quiz-help , week-module-1 , coursera-platform	3	47	August 11, 2024

C5W1: LSTM forget gate

Related topics