Hello everyone,
I have a question in regard to the lecture about establishing the baseline and I hope someone can help me with that.
The whole premise of establishing the truth baseline (for example compared with humans) is to set a truth point and then judge whether the J_train or J_CV is large or not in comparison to the baseline. In another word, giving a more meaningful understanding of the numbers that we are getting. So when we agree that human can detect voices with a 10.2% error and our model predict a 10.4% error then we can say that our model is good versus the scenario that it was a 15% error. However, the problem that I have with this is that we just shifted the problem from absolute value to relative value and have not responded to the meaning of the number.
Here the problem rises again as to whether a 0.2% error is reasonable. Let me give you an example so that it is more clear. Imagine that the ground truth is 10.2% and all other reasonable models predict a range of 10.3 ~ 10.5% error. Then is 0.2 reasonable? or the opposite scenario, what if the J error is super sensitive and with the smallest changes in the weights it jumps up to 40% percent then is a 15% error so unreasonable? What I am trying to say is now the problem is not whether the absolute number is meaningful it is the difference.
Thanks
Navid