Week 1- Bias and Variance

Marios_Constantinou · April 8, 2022, 8:38am

Given the image above from the lecture. High Bias is when the model is not performing well on the train set and high variance is when the model is performing exceptionally well on the train set.

Given:

Train set error: 15%
Dev set error: 30%

Why do we conclude that this model has a high bias AND high variance? I would think that this is still high bias because it fails on the train set and fails even more on the dev set.

Is it high bias because it fails on the train set and then high variance because it fails even more on the dev set?

Do we conclude that a model has high bias when it fails the train set and High variance when it fails the dev set? And since this model fails both, we say that it has high bias and high variance?

I think I figured it out but I will leave this thread open until someone verifies this for me

So, we check the train set, if the percentage is large (15%) then we have High bias. Then we see how much bigger is the dev set, compared to the train set. If the difference is small (15% train / 16% dev) then we conclude that we only have High Bias. If the difference is large tho (15% train / 30% dev), then we conclude that our model should go into the trash because we have High Bias (15% train) and High Variance (Performance is even worse on dev set compared to train set). But if the train set is like 1% and dev set is 15% then we say that we only have High Variance because the difference in performance is still large but not high bias because the train set error is small.

This is the general idea right?

paulinpaloalto · April 8, 2022, 8:11pm

Yes, I think your interpretation is correct. High Bias means that the error is much greater than the Bayes Error on either the training or dev sets. If you also have much better performance on the training set than the dev set (even though the training set error is still high) then you also have a High Variance problem at the same time.

But which problem you decide to attack first matters a lot depending on the situation. If the training error is high (“high bias”), then you need to address that first and you don’t have time to worry about the “high variance” problem on the dev data until you first solve the high bias problem.

Marios_Constantinou · April 9, 2022, 9:16am

Noted! Fix Bias first and then Variance. Next video addresses this as well, Basic Recipe for ML

Topic		Replies	Views
A question about high bias and high variance Improving Deep Neural Networks: Hyperparameter tun coursera-platform	2	516	October 12, 2022
Bias or variance problem Improving Deep Neural Networks: Hyperparameter tun week-module-1 , coursera-platform	1	13	January 10, 2025
A model with high variance and bias, how? Improving Deep Neural Networks: Hyperparameter tun coursera-platform	6	555	August 6, 2022
Course 2 Week 1 Basic Recipe for ML Improving Deep Neural Networks: Hyperparameter tun coursera-platform	2	519	February 3, 2022
Clear definition of bias and variance Improving Deep Neural Networks: Hyperparameter tun coursera-platform	5	1487	June 29, 2023

Week 1- Bias and Variance

Related topics