Confusion in learning curves

i200660_Mirza_Ubaidu · September 9, 2024, 9:47am

Q1) At 1:21, Andrew explains that the training error will increase as the model gains more experience. The reason is that it becomes more difficult to fit a model as more data points are added. This makes sense. However I have trained models of my own and observed that the training error can decrease because it updates its weights based on the training data and performs better on it with time. This also makes sense. So I suppose my question is what is the general trend of the training error to expect?

Q2) At 4:34, we see that the validation error decreases in case of underfiting. But in underfiting case, model performs bad both on training and validation data. So shouldn’t the validation error also increase meaning a similar shape to the training error curve?

nadtriana · September 9, 2024, 11:45am

Regarding your questions:

Q1) The general trend to expect depends on the context. If you focus on the dataset size (as Andrew does in Learning Curves), the training error will generally increase with more data. However, during the training process, the training error usually decreases as the model improves.

Q2) Andrew explains the learning curve for a model with high bias (underfitting). In this case, both the training error and the validation error are high. However, the validation error decreases initially as more data is added, before plateauing. This is because when the model is trained on very few examples, it generalizes very poorly and the validation error is large. As more data is added, the model’s generalization improves slightly, leading to a reduction in the validation error. Eventually, however, the validation error plateaus because the model is too simple to capture the underlying patterns.

Thus, while underfitting results in poor performance on both training and validation sets, adding more data can still slightly reduce the validation error early on, before it flattens out. The curves don’t mirror each other perfectly because the model’s ability to generalize may improve with more data, even if it remains underfit.

Topic		Replies	Views
Learning Curve-Week-3-Course2-MachineLearning Advanced Learning Algorithms week-module-3	7	650	July 15, 2022
Cross validation Set - Learning curves Advanced Learning Algorithms week-module-3	2	403	July 27, 2023
Validation and Training errors comparison Improving Deep Neural Networks: Hyperparameter tun coursera-platform	2	551	April 27, 2022
[final assignment] Wouldn't training data accuracy always be 100%? Neural Networks and Deep Learning coursera-platform	1	515	April 7, 2022
Sequence/process from training to testing with one hyperparameter, and also more than one Advanced Learning Algorithms week-module-3	10	542	February 4, 2023

Confusion in learning curves

Related topics