OK, I see it now.
A bit forensic investigation always helps
So the “LINEAR->RELU” layer first does affine operation (WX + b) and then applies ReLu activation function.
In this case I would suggest to slightly change the notation to “(LINEAR->RELU)->(LINEAR->RELU)->(LINEAR->SIGMOID)” or “(LINEAR | RELU) → (LINEAR | RELU) → (LINEAR | SIGMOID)”.