Hi! Does anybody know of a good resource that summarizes the state-of-the art understanding around this problem? A good article (academic or non-academic) or a survey paper maybe? Thank you! Mohammad

Understanding of local optima in deep networks

paulinpaloalto April 28, 2023, 6:44am 3

Here’s an earlier thread on this general topic that includes a link to a paper from Yann LeCun’s group on cost surfaces. Please let us know if that looks like it’s relevant for your question.

Understanding of local optima in deep networks