Hyperparameter tuning: best-of

Steffen_Cologne · August 16, 2023, 9:24pm

Laurence Moroney recommends the courses of Andrew Ng to learn more about Hyperparameter tuning in the end of the course. Though, he taught many. Do you have a specific recommendation?

Considering where the last assignment of the course left off, I feel this another essential skill to master. I am aware of a broad set of opportunities to optimize a model, potentially. Though I am missing a clear framework (/ best practices) in terms of structuring and prioritizing this task well.

Some example questions:

Should I first optimize the lr and then the batch size, and then the optimizer, and then, …
At what stage do I focus on optimizing the nn architecture, rather than the hyper parameters?
How can I optimally employ tools such as keras tuner, at what stage? Should I rather optimize one hyperparameter at a time, or a lot in conjunction (due to interdependencies)?
How to make the best of own sanity vs hyperparameter search tools?

Clearly, not everything has a black or white/ universally true answer. But I am sure, there is a lot one could learn from other’s experience and well-proven routines in this regard!

Cheers,
Steffen

balaji.ambresh · September 15, 2023, 7:53pm

Please take up deep learning specialization

Steffen_Cologne · September 26, 2023, 1:23pm

Thank you so much for sharing your suggestion! I had the opportunity to take the specialization a few years ago and found it incredibly insightful. Recently, I revisited the specialization and explored some additional materials that have deepened my understanding even further.

After several weeks of experimentation, I discovered that GridSearch, Bayesian optimization using KerasTuner, and Optuna (with various samplers and pruners) are exceptionally beneficial. I hope this information proves to be helpful to my fellow TF/ML classmates as well!

Wishing everyone continued success in their learning journey!

Topic		Replies	Views
Hyperparameters optimization Neural Networks and Deep Learning	1	506	December 9, 2021
Keras tuner deficiencies Custom Models, Layers and Loss Functions with TF week-1	2	530	June 18, 2022
On hyperparameter tuning for Course 1 week 4 assignment Introduction to TF for Artificial Intelligence ... week-4	1	555	March 10, 2022
Hyperparameter tuning in large CNN models AI Discussions	1	62	April 10, 2023
Hyperparameter search Structuring Machine Learning Projects week-1	1	14	August 20, 2024

Hyperparameter tuning: best-of

Related topics