Calcuting Y_targets in DQL in Reinforcement learning practice lab

Anupam_Kulkarni · November 7, 2022, 1:15am

So, I am trying to calculate y_targets for DQL for the Reinforcement Learning practice lab but struggling how to calculate it and getting the following error.

How I am calculating it: Based on hints in the exercise, I am checking if 1 - done_vals is true and if yes, I am setting y_targets equal to rewards + gamma * max_qsa otherwise I am setting y_targets equal to rewards. What am I doing wrong? Do I need a For loop here?

rmwkwok · November 7, 2022, 2:24am

You are doing the exact opposite of what’s required by the exercise, please read the description carefully

Anupam_Kulkarni · November 7, 2022, 7:34am

Thanks for the response! I tried flipping the order of assignment but still get different assertion error.

In the previous one I got loss = 0.72891 and now loss = 0.154475

rmwkwok · November 7, 2022, 8:22am

Hello, please follow these steps:

insert this line right before return loss in your function.

print(rewards, gamma, max_qsa, done_vals, y_targets, q_values, loss)

On the menu bar, click “Kernel” > “restart”
run code cells one by one, and stop after running the cell for test_compute_loss(compute_loss)
given the printed raw numbers, check with the formula in the description for whether your function’s computed loss is as expected
if you can’t figure out the problem, please share the printing result and any error here again.

Raymond

Anupam_Kulkarni · November 26, 2022, 12:52am

Thank you very much for your reply! Very sorry for the delay in my response! I printed but again getting the same error,
Results below

Anupam_Kulkarni · November 26, 2022, 12:55am

I wonder if how I am assigning value is in correct. I am using a simple assignment statement. Do I need to use any tensor flow specific assignment here?

rmwkwok · November 26, 2022, 1:06am

Hello @Anupam_Kulkarni,

I suggest you to verify your work with

if ... :
    .....
    print('done = ', done, 'first set of statements ran')
else:
    ....
    print('done = ', done, 'second set of statements ran')

Raymond

PS: We can’t share assignment code here so I removed it.

Topic		Replies	Views
C3_W3_Assignment1 Unsupervised Learning, Recommenders, Reinforcement week-3	3	545	December 4, 2022
Test_compute_loss fails in my assignement Unsupervised Learning, Recommenders, Reinforcement week-3	4	493	March 10, 2023
Deep Q-Learning Algorithm with Experience Replay Unsupervised Learning, Recommenders, Reinforcement week-3	1	512	November 6, 2022
C1_W3_Logistic_Regression UNQ_C2 : What's wrong with my implementation? Supervised ML: Regression and Classification week-3	9	232	May 6, 2024
# UNQ_C2 # GRADED FUNCTION: compute_gradient Supervised ML: Regression and Classification week-2	7	503	January 20, 2023

Calcuting Y_targets in DQL in Reinforcement learning practice lab

Related topics