C3_W3 Quiz (State-action value function) Question 2

shinli256 · March 5, 2024, 8:36am

The question reads:

You are controlling a robot that has 3 actions: ← (left), → (right) and STOP. From a given state s, you have computed Q(s, ← ) = -10, Q(s, → ) =-20, Q(s, STOP ) = 0.

What is the optimal action to take in state s?

I see what it’s asking and have no problem with the answer. However, I do think we can specify what “optimal” means here, more specifically, whether we are optimizing the reward of one action or the total return.

rmwkwok · March 5, 2024, 8:46am

Hello Joe,

Would you think differently after reading this again, that Q is defined to be covering “after that” too?

Cheers,
Raymond

shinli256 · March 5, 2024, 9:02am

Thanks providing the material.

I suppose they’re consistent in using the actually useful explanation for “optimal”.

Topic		Replies	Views
I think there is an error or is my understanding off Unsupervised Learning, Recommenders, Reinforcement week-module-3	9	579	February 25, 2023
Need help to understand when you have more then 2 actions for a state Unsupervised Learning, Recommenders, Reinforcement week-module-3	2	463	April 6, 2023
State Action value function [Coursera Video] Unsupervised Learning, Recommenders, Reinforcement week-module-3	4	490	July 31, 2024
State-action function quiz Unsupervised Learning, Recommenders, Reinforcement week-module-3	1	459	April 26, 2023
State-action value function example? Unsupervised Learning, Recommenders, Reinforcement week-module-3	8	601	September 9, 2022

C3_W3 Quiz (State-action value function) Question 2

Related topics