How w, b are overly optimistic when we choose d using a test data?

icybergenome · August 10, 2024, 4:40pm

Since parameters w and b wasn’t trained using test data. Then how come we can call them overly optimistic when evaluating a cost function out using test data.

Alireza_Saei · August 10, 2024, 4:49pm

Hi @icybergenome

The test data shares similar patterns, distributions, and features with the training data, even though it wasn’t used for training. This similarity can cause the model to perform better on the test data than it might on entirely new, unseen data (overly optimistic estimate of its generalization capability).

Hope this helps! Feel free to ask if you need further assistance.

icybergenome · August 10, 2024, 4:53pm

As per the proposed solution we need to use Cross Validation Set which is also bifurcated from the available data. Such as 60% training set, 20% cross validation set, 20% test data.

So CV set might contain the similar patterns, distributions, and features. How come its distinguished from test set?

Alireza_Saei · August 10, 2024, 4:57pm

CV is used to tune hyperparameters and adjust the model properties. However, the Test Set is reserved as a completely unseen dataset that is used to evaluate the final model’s performance after the hyperparameters have been tuned. This step evaluates the model’s ability to generalize.

TMosh · August 10, 2024, 7:31pm

Statistically all three subsets should be very similar.
The only differences are in how they are used.

Topic		Replies	Views
Train,dev set Improving Deep Neural Networks: Hyperparameter tun week-1	1	12	October 25, 2024
Bias, variance diagnostic Advanced Learning Algorithms week-3	2	492	November 10, 2022
Test set and Validation set Advanced Learning Algorithms week-3	10	520	January 15, 2023
Model Selection based on CV or Test & Diff b/w CV and Test data Advanced Learning Algorithms week-3	17	473	December 6, 2023
I didn't understand this part Advanced Learning Algorithms week-3	8	346	December 20, 2023

How w, b are overly optimistic when we choose d using a test data?

Related topics