Where does the information to improve Q come from?

rmwkwok · October 19, 2022, 8:02am

Hello Douglas,

Let me ask some clarifying questions:

Can you share a screenshot of the slide which contains the equation? I am not sure about which equation you are referring to. You may find the slides here.

I have a feeling that you are speaking about the Bellman equation which is the sum of a series of rewards discounted by gammas.

However, in the first post of this thread, I suppose you are speaking about the Q-network. Although Q-network and the Bellman equation both speak about the Q-values, they are not equivalent. Which one should we focus on now?

You are referring the information to as the “only one component of the sum”?

current one = reward in the current state?

So is this your hypothesis? Is this what you want to talk about in this post? And how does it relate to Reinforcement learning?

For me to just look at this statement, not in the context of RL, I would agree that if the sum of a series of number is positive, the chance is higher for me to pick a positive number from the series, if the numbers are gaussian distributed.

Let me know

Cheers,
Raymond

Topic		Replies	Views
Neural network on bellman equation Unsupervised Learning, Recommenders, Reinforcement week-module-3	9	159	July 20, 2025
How does the Q-Learning Algorithm actually learn? Unsupervised Learning, Recommenders, Reinforcement week-module-3	18	616	December 5, 2023
Confusion regarding basic mathematics of DQN Algorithm Unsupervised Learning, Recommenders, Reinforcement week-module-3	11	380	February 13, 2024
Learning the Q Function Unsupervised Learning, Recommenders, Reinforcement week-module-3	16	592	July 13, 2023
Confused about reniforcement learning Unsupervised Learning, Recommenders, Reinforcement week-module-3	1	257	March 26, 2024

Where does the information to improve Q come from?

Related topics