can anyone suggest a any resources for reinforcement learning on these topics
1.multi -armed bandit
2. UCB
3.tic tac toe
4. MDp
5. gradient Bandit & non stationary problems
can anyone suggest a any resources for reinforcement learning on these topics
1.multi -armed bandit
2. UCB
3.tic tac toe
4. MDp
5. gradient Bandit & non stationary problems