How to train RL model, if the reward for given state is changing dynamically?

Arjun_Reddy · February 4, 2023, 7:17pm

Let’s take an example, Reward for a state is changing dynamically in an environment, in that case how to train a RL model? Like for an example, with respective time the reward for a state is changing like for instance at one point of time the reward is +20 for state - 1 and on the same state - 1 at another point of time the reward is becoming -10. In this case how to train a model, where the reward is changing for a given state dynamical?

TMosh · February 4, 2023, 9:36pm

If the variation is based on time, then time should be one of the features for the Q network.

If the variation is based on specific states, then the number of states must be increased to cover those conditions.

If you can’t quantify what the reward is for every state, then you will have difficulty using reinforcement learning.

Topic		Replies	Views
States, actions, rewards Unsupervised Learning, Recommenders, Reinforcement week-3	4	453	August 8, 2023
How RL tackles this situation? Unsupervised Learning, Recommenders, Reinforcement week-3	3	480	February 1, 2023
Reinforcement Learning Intial State and reward Unsupervised Learning, Recommenders, Reinforcement week-3	10	513	March 22, 2023
Please help me with reinforcement learning Unsupervised Learning, Recommenders, Reinforcement the-batch , ai-discussions , langchain	1	42	October 12, 2024
Lunar lander reward Unsupervised Learning, Recommenders, Reinforcement week-3	10	328	November 12, 2023

How to train RL model, if the reward for given state is changing dynamically?

Related topics