Experience Replay

Hi Mentor,

How experience replay avoids the problem of Oscillations ? How basically oscillations occurs ? Due to what oscillations occurs ?

My intuition is if the network biased with respect to consecutive training samples and if when the model sees new training example (variant), the gradient direction will oscillates ?