Why q_network is used instead of target_q_network for inference in C3_W3_A1_Assignment?

rmwkwok · September 7, 2022, 3:34am

Hello @Akilesh_Ramalingam, the soft update to the TQN (Target Q Network) works by using the QN, so first of all, the QN needs to be trained and retained throughout the learning process. Also the purpose of the TQN is to keep QN more stable. For more relevant discussions and experiments, I strongly suggest you to spend 10-20 minutes to read through the thread from this point onwards or from the beginning of the thread.

Cheers,
Raymond

Topic		Replies	Views
Don't understand why we use q_netword & target_q_network Unsupervised Learning, Recommenders, Reinforcement week-3	1	357	September 19, 2023
Don't fully understand q_network and target_q_network Unsupervised Learning, Recommenders, Reinforcement week-3	4	386	August 28, 2023
C3W3 assignment: what is soft update? Unsupervised Learning, Recommenders, Reinforcement week-3	1	538	August 30, 2022
Target Network Clarification Unsupervised Learning, Recommenders, Reinforcement week-3	3	752	July 10, 2023
Confusion on Target Variable Deep Reinforcement Unsupervised Learning, Recommenders, Reinforcement week-3	28	932	September 15, 2022

Why q_network is used instead of target_q_network for inference in C3_W3_A1_Assignment?

Related topics