Bad hyper parameters vs bad model

Nevermnd · December 31, 2025, 11:02pm

For the final assignment of C2W3, in the last section where we are to select hyperparameters to train our DistilBERT model, it suddenly became clear to me how tough this is, and why we were earlier trained to use Optuna for hyper parameter search given how tough it is.

So, for those more experienced, is it only an ‘art’ or does it get better with practice ?

And a grander question here: How do you know when no hyper parameters will work, or you simply have a bad model / error in the model.

I ask this in part because even with the suggested parameters I was still getting lousy loss and F1 scores (i.e. maybe there was something lousy/broken in my model that the grader just ‘missed’) ?

Any advice would be appreciated, that it is not in fact ‘conjuring Voodoo’, but actually a science.

TMosh · January 1, 2026, 1:03am

It pretty much boils down to trying every trick in your toolbox to get improved performance.

If that doesn’t help, then question whether you’re using an appropriate sort of model.

This gets much easier with experience, because your bag-of-tools will grow.

Topic		Replies	Views
Hyperparameter tuning: best-of Sequences, Time Series and Prediction week-module-4	2	563	September 26, 2023
Evaluation of models Advanced Learning Algorithms week-module-3	2	479	February 14, 2023
Identify correct parameter values NLP with Probabilistic Models week-module-4	3	389	July 28, 2023
Week 2 Lab - what parameters to use to fully fine-tune the model? (part 2.2) Generative AI with Large Language Models ai-discussions	4	99	March 11, 2025
Hyperparameter search Structuring Machine Learning Projects week-module-1 , coursera-platform	1	36	August 20, 2024

Bad hyper parameters vs bad model

Related topics