Input to DQN in reinforcement learning

sandeep_kumar13 · May 16, 2023, 5:49am

Iam not getting the sense, without giving action as input along with state, how this model is learning the Q(s,a). Because this Q(s,a) depends on action and we are not giving any information about action.

In the previous architecture we used to give action along with state as input and model gives Q(s,a) as the output.

Mujassim_Jamal · May 16, 2023, 7:48am

I think the way this is accomplished is through a training process and the design of the neural network. The neural network uses information about the current state and the rewards it observes to learn. It does this by trying to make the predicted Q-values (which estimate the expected rewards) as close as possible to the target Q-values.

rmwkwok · May 16, 2023, 8:20am

Hi @sandeep_kumar13,

The output layer has 4 units because there are 4 and only 4 actions. Each of the 4 units represents a Q(s, a). For example, the second unit represents Q(s, a = 2) where a = 2 denotes the second action.

a is used and it is used to select the chosen one out of the four possible outcomes of Q. a is not missed out.

Cheers,
Raymond

Topic		Replies	Views
State and Action as Input vs State as Input and Q Values as Output Unsupervised Learning, Recommenders, Reinforcement week-3	2	286	March 17, 2024
Reinforcement learning: How can you be sure the NN calculates the right thing? Unsupervised Learning, Recommenders, Reinforcement week-3	1	523	August 14, 2022
Deep Reinforcement Learning Unsupervised Learning, Recommenders, Reinforcement week-3	1	498	January 2, 2023
Reinforcement Learning Unsupervised Learning, Recommenders, Reinforcement week-3	1	70	July 1, 2024
Please help me with reinforcement learning Unsupervised Learning, Recommenders, Reinforcement the-batch , ai-discussions , langchain	1	38	October 12, 2024

Input to DQN in reinforcement learning

Related topics