C2W3 - Pandas vs Caviar

Giacomo_Demarie · March 31, 2022, 10:37am

Hi all,

I have a (maybe) silly question.

In the lecture about the two main approaches that are followed for tuning the hyperparameters, the final choice about yheir values is done referring to a key metric. For example, this metric could be the prediction error.

Here my question: must this metric be evaluated on the development set, right? In fact, in an earlier lecture it is mentioned that the dev set is used for choosing among different models (i.e. among different hyperparameters settings).

Am I right?

Thanks, ciao, Giacomo

SomeshChatterjee · April 1, 2022, 4:40am

Hi Giacomo,

You are correct. We split the whole dataset into 3 parts: training, validation (development) and test set. All model evaluations (different algorithms and different hyperparameter options) are evaluated on the development set.

Once we are satisfied by the performance of our model on the dev set, or have selected the best possible model, we finally pass in the test set through it to ensure that the model can actually generalize well and it didn’t overfit to the development set.

Hope this helps.

Giacomo_Demarie · April 4, 2022, 8:36am

Thanks so much SomeshChatterjee, now everything’s clear!

Giacomo

Topic		Replies	Views
Why not evaluate models on test set? Deep Learning Resources	4	220	July 12, 2022
Week1 Lecture1 Query regarding the point mentioned at time 10.30. Train/dev sets Improving Deep Neural Networks: Hyperparameter tun	2	518	March 8, 2022
Selecting the right model Improving Deep Neural Networks: Hyperparameter tun	3	529	July 5, 2022
Differences in Dev and Test set Improving Deep Neural Networks: Hyperparameter tun	6	651	September 23, 2021
Confusion about Training Set vs. Dev Set Improving Deep Neural Networks: Hyperparameter tun	5	814	December 19, 2021

C2W3 - Pandas vs Caviar

Related topics