The forget (vs. remember) gate

sgomezgomez · November 9, 2023, 10:12am

Here’s what our Course 5, Programming Assignment 1 (Building a Recurrent NN Step by Step) mentions about the “forget gate”:

"* The “forget gate” is a tensor containing values between 0 and 1.

If a unit in the forget gate has a value close to 0, the LSTM will “forget” the stored state in the corresponding unit of the previous cell state.
If a unit in the forget gate has a value close to 1, the LSTM will mostly remember the corresponding value in the stored state."

Based on that definition, shouldn’t it be called the “remember” gate?

rmwkwok · November 9, 2023, 10:27am

Hi @sgomezgomez,

I can see your point, but that is the name .

I am personally quite comfortable emphasizing its ability to forget because it is the added feature in LSTM to forget. Without the “forget” gate, it always attempt to remember, so seems to me no point to emphasize how it can remember.

Cheers,
Raymond

Topic		Replies	Views
C5W1: LSTM forget gate Sequence Models coursera-platform	1	543	October 7, 2022
Typo in C5W1A1: remembered -> forgotten Sequence Models coursera-platform	4	531	March 12, 2022
Forget gate vs. Update gate in LSTM Sequence Models coursera-platform	2	548	December 20, 2021
W1 Quiz: Inconsistent grading logic applied to Question 9: Update Gate and Forget Gate Sequence Models quiz-help , week-module-1 , coursera-platform	3	47	August 11, 2024
Note for week 1 assignment 1 for LSTM Sequence Models coursera-platform	1	571	June 27, 2021

The forget (vs. remember) gate

Related topics