Basic Recipe for ML - Week 1 - Train larger/More data?

Marios_Constantinou · April 8, 2022, 9:11am

I am a bit confused about Train larger for high bias and more data for high variance.

By train larger we mean that we should split the data so we have even more training examples? And by more data we mean increase our training examples all together?

So if I have 2000 training examples with a 60/20/20 split and I see a high bias problem, one of the solutions is to increase the train set? Like 70/15/15?

And if I have high variance, by saying “more data” we mean get like 2500 training examples?

gent.spah · April 8, 2022, 1:06pm

High bias is underfittng, high variance is overfitting.

When it says train larger it means a more complex model architecture so model fits the training data better.

When it says more data it means because of overfitting you need more data to present scenarios that are different from those that the training set has in it which describe a more inclusive distribution of the data.

Marios_Constantinou · April 9, 2022, 9:11am

you need more data to present scenarios that are different from those that the training set has

So increase the split ration or increase the whole dataset in general?

gent.spah · April 9, 2022, 10:30am

Either but the important this is that you need data which can better represent the distribution of information.

Marios_Constantinou · April 9, 2022, 10:41am

Thank you for the info!

Judd · April 11, 2022, 7:58am

High bias means the overall accuracy for both training and test sets is low.

The line says train longer, not train larger.

Marios_Constantinou · April 11, 2022, 8:40am

Oh dmn, you’re right. This makes more sense haha

Topic		Replies	Views
Week1 Quiz Problem Improving Deep Neural Networks: Hyperparameter tun coursera-platform	1	546	May 25, 2022
High bias, high variance Improving Deep Neural Networks: Hyperparameter tun coursera-platform	1	370	September 6, 2023
Quiz-Practical aspects of Deep Learning Improving Deep Neural Networks: Hyperparameter tun coursera-platform	2	601	August 25, 2022
How to deal with high bias and high variance? Advanced Learning Algorithms week-module-3	1	689	July 19, 2022
A model with high variance and bias, how? Improving Deep Neural Networks: Hyperparameter tun coursera-platform	6	555	August 6, 2022

Basic Recipe for ML - Week 1 - Train larger/More data?

Related topics