Hi all, would be great if someone can help me with my understanding on the 2,2,1 Neural Network on slide 503 of the PDF:
Currently I do not understand why a1 and a2 should come to different results as - from my understanding
- both have the same inputs x1, x2 and b
- both have the same number of iterations for the optimization and use the same algorithm
- both use the same starting points for the optimizations
- So my assumption would be that w11 = w12, w21 = w22 and b1=b2 after the optimization thus I would expect a1 = a2
If this would be correct, I currently do not understand why we should not go with a 2,1 Network.
Would be great if someone could help me what I am missing here. Thx for your help!