LSTM vs bidirectional

aperrot42 · February 4, 2025, 10:30am

I am following the course, and arrived at the video describing LSTM and bidirectional RNN.

I am sorry if this question feels unclear or dumb.

It feels like the motivation of these two techniques and the issue they are trying to resolve is the same : having parts of the inputs “far” from the current prediction change the current prediction : create “links” between distant input tokens.

Do we still need LSTM when we “fold” the input the way we do using bidirectional RNN ?

gent.spah · February 4, 2025, 2:53pm

You can have uni-directional LSTM and bidirectional LSTM-based networks as well as other types based on different cells! I mean the RNN cells can be simple RNN, LSTM GRU.

paulinpaloalto · February 4, 2025, 7:26pm

Right! The point of Bidirectional is that at any given timestep the cell can “see” (be influenced by or learn from) both the previous and the future timesteps. In a unidirectional LSTM (or any other type of RNN), that’s not true: it can only be influenced by what happened in the past timesteps.

The point of LSTM versus plain vanilla RNN is that it’s easier for it to learn from things that happened far in the past. Then when you add Bidirectional, it can learn from things far in the future as well.

aperrot42 · February 4, 2025, 8:55pm

Thanks to both of you.
My take here and problem solved by bidirectional RNN and RNN with LSTM :

bidirectional : take into account future events
LSTM : take into account “far” events (past or future depending on the architecture).

Topic		Replies	Views
Bidirectional layer for time series forecasting Sequences, Time Series and Prediction week-module-3	3	507	July 21, 2023
Https://www.coursera.org/learn/tensorflow-sequences-time-series-and-prediction/lecture/8u3wq/coding-lstms Sequences, Time Series and Prediction week-module-3	3	527	April 30, 2023
The input of bi-directional layer Sequence Models coursera-platform	8	584	June 25, 2021
Why did the RNN after picking the learning rate did not use Bidirectional Layer? Sequences, Time Series and Prediction week-module-4	1	528	March 28, 2022
C5W1 - bidirectional RNN alternatives? Sequence Models coursera-platform	1	551	June 3, 2022

LSTM vs bidirectional

Related topics