Why Isn't Cost Visualized as a Vectored Cone Toward the Optimal Trough?

paulinpaloalto · June 9, 2025, 8:01pm

Another thing worth saying here is that, before you get too far down this rabbit hole, please realize that Linear Regression is by far the simplest problem mathematically that you will see here. It’s actually solvable in closed form! As soon as you graduate to a neural network with more than one layer, you can say “bye bye” to convexity and closed form solvability. It’s not unusual to see neural networks with hundreds of layers and millions of parameters. So the solution surfaces are non-convex and embedded in \mathbb{R}^n for some very large value of n.

Here’s a paper from Yann LeCun’s group which talks about solution surfaces for neural networks. Here’s a thread about Weight Space Symmetry and the number of potential local optima that is more food for thought.

Topic		Replies	Views
Week3 lab 4 Supervised ML: Regression and Classification week-3	2	21	April 4, 2025
[Question/Validation] Negative J(w,b) in the lecture photo Supervised ML: Regression and Classification week-3	7	333	October 13, 2023
Gradient Descent - Pls explain how it knows which direction from top of the hill Supervised ML: Regression and Classification week-1	12	569	October 21, 2023
Minimizing the cost function question Supervised ML: Regression and Classification week-2	24	1033	July 15, 2022
Flat points : are those completely flat? Advanced Learning Algorithms week-2	19	299	December 15, 2023

Why Isn't Cost Visualized as a Vectored Cone Toward the Optimal Trough?

Related topics