Hi all,
I was playing with the lab to see how actions are getting altered with respect to reward and discount factor(I chose 0.5). How does one choose the action when there’s a tie in return? Here in the below image, it takes left. Is it because there’s a logic that checks for least number of steps if there’s a tie?
How do we typically deal with tie breakers? Do we add any rules based on physics of the problem that we are dealing with?
Thanks,
Hari