How to get y(i) value for 10000 training examples

Y_L1 · March 21, 2024, 3:55am

When Mr. Ng says there are 10,000 training examples of x,y pairs, where does the value of y(i) come from? When training what is the gorund truth y(i) and the predicted y(i)?

gent.spah · March 21, 2024, 6:50am

Y(i) is the second value of the x, y pair. Y(i) is the ground truth and whatever the model predicts is the predictions! Keep one going through the video maybe repeat them again!

Y_L1 · March 22, 2024, 12:56am

I meant how do you obtain the y ground truth value (the Q(s’,a’) is not known). I didn’t understand this slide in the presentation. He says to randomly initialize the network and obtain a guess of the value of Q(s,a). Would this guess be the ground truth value used during training. Since the network produces this ground truth value, if you train with this ground truth value, you would have a zero loss. And, learning (updating the parameters) does not take place, when there is a zero loss. On the slide it says set Q=Q(new). Is Q the guess? So what is Q(new)? How do you get Q(new) value?

Topic		Replies	Views
Confused about reniforcement learning Unsupervised Learning, Recommenders, Reinforcement week-module-3	1	245	March 26, 2024
Question about state value function learning algo Unsupervised Learning, Recommenders, Reinforcement week-module-3	4	525	April 19, 2023
How does the Q-Learning Algorithm actually learn? Unsupervised Learning, Recommenders, Reinforcement week-module-3	18	562	December 5, 2023
Q function training Unsupervised Learning, Recommenders, Reinforcement week-module-3	3	27	August 20, 2024
Where does the information to improve Q come from? Unsupervised Learning, Recommenders, Reinforcement week-module-3	17	768	February 14, 2023

How to get y(i) value for 10000 training examples

Related topics