The Return in Reinforcement Learning

Asutosh_Rath · September 14, 2022, 2:14pm

I am unable to understand return in Reinforcement Learning. I am little bit confused as the return will return value when the mars rover come to the beginning position or
It is the value when the rover is going towards the reward rover will get return as discount

Wendy · September 14, 2022, 7:42pm

@Asutosh_Rath,
The return is a way for you to compare one approach to another to decide which is better. That means you’re looking at all the steps it takes for you to get from the beginning position to the terminal state.

One way to calculate a return would be to just sum up all the rewards at each state as you step through, but we want to encourage getting to the terminal state quicker, which is why we include a discount factor for each step along the way.

With this in mind, try watching the video again. I think it will click for you.

Asutosh_Rath · September 15, 2022, 12:46am

Thanks, It’s quite helpful to me.

Topic		Replies	Views
Question on discounting Unsupervised Learning, Recommenders, Reinforcement week-3	8	482	November 7, 2022
What are examples of other return formula used in real-world applications? Unsupervised Learning, Recommenders, Reinforcement week-3	1	488	August 19, 2022
Reinforcement - Terminology of "first step" Unsupervised Learning, Recommenders, Reinforcement week-3	5	322	December 8, 2023
Reinforcement Learning Intial State and reward Unsupervised Learning, Recommenders, Reinforcement week-3	10	513	March 22, 2023
Confusion regarding basic mathematics of DQN Algorithm Unsupervised Learning, Recommenders, Reinforcement week-3	11	341	February 13, 2024

The Return in Reinforcement Learning

Related topics