{moderator edit - quiz text and answers removed}
Shouldn’t the answer be True for this question?
Deep networks do overfit the data resulting in lower training error. That is why we use residual networks to reduce overfitting and to avoid vanishing gradients?