How does these techniques satisfy orghogonalization?

manifest · May 27, 2021, 2:40pm

Hey Sara,

I believe this post contains the answer to your question.

Note that

A bigger model and different architecture may be seen as a procedure of changing a model capacity.
A hyperparameter is any parameter that we fix during training process. In that sense, a number of layers is a hyperparameter that changes model capacity, a regularization coefficient of norm penalties is a hyperparameter that allows us to increase or decrease regularization, etc.
An optimization algorithm definitely has an effect on learning and its learning rate is probably the most important hyperparameter, but we do not consider it as an instrument for reducing bias or variance. It determines how fast we converge to some solution.

Topic		Replies	Views
Changing NN architecture/hyperparameters and orthogonalization Structuring Machine Learning Projects	1	670	May 22, 2021
Is orthogonalization for all ML projects, or just deep learning projects? Structuring Machine Learning Projects	1	589	May 27, 2021
Network size and bias variance tradeoff Improving Deep Neural Networks: Hyperparameter tun	3	646	April 26, 2021
Week 1: dropout vs reducing network? Improving Deep Neural Networks: Hyperparameter tun	14	1354	August 19, 2023
Avoidable bias and Variance Structuring Machine Learning Projects	2	657	July 4, 2022