In optional coffee lab, the high output value in layer 1 corresponds to bad roast region.
But I think this output must represent the probability of y=1, as the layer 1 also uses sigmoid activation function. So if output is high then the probability of good roast will also be high.
Kindly correct me if Iam wrong or missing something.
Layer 1 using sigmoid is not sufficient for its outcome to represent the probability of y = 1.
The output layer has this representation because this is what the objective function optimizes for.
The objective function does not enforce any hidden layer to share the same representation.
@rmwkwok thank you very much for the clarification.
You are welcome @sandeep_kumar13