A doubt in Deep Q Learning

JJaassoonn · July 10, 2023, 7:56am

Dear Administrator,

Could you please guide me on this issue?

“we can estimate the action-value function iteratively by using the Bellman equation”，quoted from C3_W3_A1_Assignment, Section 6 - Deep Q-Learning

May i know the reason of labelling i and i+1 differently?

Thank you

conscell · December 5, 2024, 11:56pm

The distinction between Q_i and Q_{i+1} reflects the iterative nature of the algorithm, where each update builds on the previous estimate.
Here Q_i represents the current estimate of the action-value function after the i-th iteration, and Q_{i+1} represents the updated estimate of the action-value function after applying the Bellman update rule to Q_i. The process iterates until Q_i converges to the true action-value function Q^*. You can find more detailed explanation here.

Topic		Replies	Views
What is the difference between "State action value function" and "Bellman Equation"? Unsupervised Learning, Recommenders, Reinforcement week-3	6	548	February 20, 2023
Can 't see how these are equal in the application of Bellman's eq'n in the Learning the state-value function lecture Unsupervised Learning, Recommenders, Reinforcement week-3	2	15	February 8, 2025
Problem of the final lab Unsupervised Learning, Recommenders, Reinforcement week-3	3	502	February 14, 2023
DQN vs Q-Function Unsupervised Learning, Recommenders, Reinforcement week-3	6	544	August 8, 2022
Question about state value function learning algo Unsupervised Learning, Recommenders, Reinforcement week-3	4	520	April 19, 2023

A doubt in Deep Q Learning

Related topics