Don't understand why we use q_netword & target_q_network

youssef_bayoumi · September 18, 2023, 12:04pm

We are using target_q_network to make predictions or get target_y and then we are training the q_network on this new data ,after training the q_network we are updating the target_q_network using Soft Update (according to my understanding )
couldn’t we just done this using 1 nn or am I missing the whole point

rmwkwok · September 19, 2023, 2:29am

Hello @youssef_bayoumi,

You have given a brief description of how the training involves the target Q network, but not yet your understanding on its advantages. Obviously, the existence of the target Q network enables us to do the soft update, and with that in mind, please

review the lecture " Algorithm refinement: Mini-batch and soft updates (optional)" starting from ~7:50 to the end, and
review the week’s assignment for its section 6.1

During your review, write down a list of advantages and that by itself will be some answers to the question of why we are having 2 NNs that are connected via the soft-update.

If you have any follow-up, please share the list of advantages so we can discuss based on your latest understanding.

Cheers,
Raymond

Topic		Replies	Views
Why q_network is used instead of target_q_network for inference in C3_W3_A1_Assignment? Unsupervised Learning, Recommenders, Reinforcement week-3	1	520	September 7, 2022
Don't fully understand q_network and target_q_network Unsupervised Learning, Recommenders, Reinforcement week-3	4	386	August 28, 2023
Target Network Clarification Unsupervised Learning, Recommenders, Reinforcement week-3	3	752	July 10, 2023
C3W3 assignment: what is soft update? Unsupervised Learning, Recommenders, Reinforcement week-3	1	538	August 30, 2022
Confusion on Target Variable Deep Reinforcement Unsupervised Learning, Recommenders, Reinforcement week-3	28	932	September 15, 2022

Don't understand why we use q_netword & target_q_network

Related topics