Neural Network Clarification

satmuri · January 2, 2025, 5:53pm

Say we are dealing with a logistic cost that is a convex function, I am confused how Neural Networks in each layer though initialized with different parameters, how come each neuron does not come up with the same output after gradient descent? I don’t understand how the neural network doesn’t collapse, and I feel like I am missing some intuition of how the neural networks learn patterns by themselves.

TMosh · January 2, 2025, 6:27pm

Because of the non-linear activation function in the hidden layers, the NN cost function is not convex.

Given different initial values, each weight will follow its own trajectory to a value that minimizes the cost.

It seems like magic, and I don’t have a mathematical proof. But it works.

SNaveenMathew · January 3, 2025, 11:00pm

Your intuition is right under special circumstances where the gradients wrt each weight in a layer will be identical if all the weights are initialized the same constant. This Stack Overflow discussion gives you some more intuition.

Topic		Replies	Views
Initializing parameters in feedforward neural network AI Discussions ai-discussions	5	20	September 7, 2024
About local minimum in NN Advanced Learning Algorithms general	6	53	July 7, 2024
Will NN return the same parameters for a set of data when run multiple times? Neural Networks and Deep Learning	3	740	October 16, 2022
Gradient Descent Doubt AI Discussions	7	114	July 11, 2022
Cost function - How can we make sure that we end up in the global minimum and not one of the local minima Supervised ML: Regression and Classification week-2	2	821	December 3, 2022

Neural Network Clarification

Related topics