Not understanding the structure of GRU

Rashmi · October 14, 2022, 12:01pm

Hello Yifu and Ajinkya,

Here’s a quick link from this post: Understanding GRU - #3 by piyush23, which can give you a broad idea on how GRU solves the vanishing gradient issue while using an RNN architecture.

DLS mentor Kic has posted a link in one of his replies:

The post aims at solving the vanishing gradient problem which comes with a standard recurrent neural network.

To solve the vanishing gradient problem of a standard RNN, GRU uses, so-called, update gate and reset gate. Basically, these are two vectors which decide what information should be passed to the output. The special thing about them is that they can be trained to keep information from long ago, without washing it through time or remove information which is irrelevant to the prediction.

Topic		Replies	Views
GRU and vanishing gradients Sequence Models coursera-platform	6	657	November 7, 2022
Understanding GRU Sequence Models coursera-platform	3	649	July 28, 2021
Course 5, week 1: How is it that -- because the GRU update gate is usually close to 0 -- we do not have a vanishing gradient problem? Sequence Models coursera-platform	5	563	June 26, 2022
GRU and vanishing gradient Sequence Models week-module-1 , coursera-platform	1	30	August 5, 2024
Week1 Quiz doubt Sequence Models coursera-platform	1	506	April 16, 2023

Not understanding the structure of GRU

Related topics