Hi!
I am trying to solve a MinMax optimization problem using reinforcement learning but I don’t know how to design states, reward, and actions.
I would appreciate if anyone can help me.
The control variable for Max and Min is different and we have a combination of vectors and matrices in the objective function. In addition, it is a combinatorial problem.