So, I am trying to calculate y_targets for DQL for the Reinforcement Learning practice lab but struggling how to calculate it and getting the following error.
How I am calculating it: Based on hints in the exercise, I am checking if 1 - done_vals is true and if yes, I am setting y_targets equal to rewards + gamma * max_qsa otherwise I am setting y_targets equal to rewards. What am I doing wrong? Do I need a For loop here?
I wonder if how I am assigning value is in correct. I am using a simple assignment statement. Do I need to use any tensor flow specific assignment here?