C1W2 - Can adding data hurt? - What is meant by large model

alechewitt · March 6, 2024, 1:05am

In the video “Can adding data hurt”, Andrew states you need a large model in order to ensure adding data won’t hurt.

What is meant by a large model?

Is this referring to a neural network that has many layers? Or, a a network with a lot of parameters? Or something else?

Many thanks in advance

TMosh · March 6, 2024, 1:21am

Both.

saou_a · March 18, 2024, 9:57am

Hi @alechewitt ,

FIrst of all, in most cases, adding more layers to a neural network will also increase the number of parameters.

Large model means that it has the potential to learn more complex relationships from the data, in that sence, it could mitigate the risk of adding data.

On the other hand, adding too much data to a model not “large enough” could lead to overfitting, meaning it memorizes the training data too well and performs poorly on unseen data.

@saou_a

TMosh · March 19, 2024, 4:50am

In my experience, adding more data helps reduce overfitting, rather than causing it.

These snips are from an article from AWS.

“Data augmentation” increases the amount of training data.

Topic		Replies	Views
Data Augmentation is OK when "model is large" Introduction to Machine Learning in Production	1	540	March 11, 2022
Large model vs small model when augmenting data Introduction to Machine Learning in Production	1	595	May 29, 2021
Basic Recipe for machine learning Question Improving Deep Neural Networks: Hyperparameter tun	1	537	June 6, 2021
Basic Recipe for ML - Week 1 - Train larger/More data? Improving Deep Neural Networks: Hyperparameter tun	6	541	April 11, 2022
Can adding data hurt? Introduction to Machine Learning in Production week-2	2	42	September 16, 2024

C1W2 - Can adding data hurt? - What is meant by large model

Related topics