Hello Everyone,
I was trying to rewrite the reinforcement learning algorithm from scratch.
but it’s not learning anything literally. I repeatedly compared the code, and it seemed the same
that a link of the jupyter.
the issue is the total points are going up and down crazily with each episode.
Thanks