Hyperparameters optimization

gabic101 · December 9, 2021, 3:51pm

It seems to me that maybe not all the hyperparameters could be discovered by individual tuning in isolation, but some (at least) are rather dependent on each other. Considering the example of the XOR function from this course, the learning rate tuning, it seems to me, will depend a lot on the number of hidden layers and their respective sizes.

Therefore, I was thinking if there is some better known approach of how to optimize the hyperparameters as a set, not in isolation. I thought maybe to take a random smaller set of data and then run a set of hyperparams, compute a sort of cost (fitness) function and use some sort of gradient descent alg.

Another idea, but more time costly, could be to use genetic algorithms, but I’m not sure if it’s feasible or it’s just an overkill.

paulinpaloalto · December 9, 2021, 10:06pm

This is an interesting set of questions and ideas. It turns out that there is just too much material to cover all in one course, so the systematic approach to tuning hyperparameters is reserved until Week 1 of the next course in the series. My suggestion would be to “hold that thought” and stay tuned to hear what Prof Ng says about these issues in Course 2.

Topic		Replies	Views
Deep Learning for Hyper Parameters Improving Deep Neural Networks: Hyperparameter tun	1	496	April 22, 2022
Hyperparameter tuning: best-of Sequences, Time Series and Prediction week-4	2	458	September 26, 2023
Activation functions as hyperparameters Improving Deep Neural Networks: Hyperparameter tun	1	549	September 14, 2021
General implementation of deep neural network for multi class classification problem, using course 1 and course 2 Improving Deep Neural Networks: Hyperparameter tun week-1	13	292	January 5, 2024
About tuning learning rate in c4w4 assignment Sequences, Time Series and Prediction week-4	1	552	August 19, 2022

Hyperparameters optimization

Related topics