WHAT DOES THESE LINES DESCRIBE
So this big neural network doesn’t do anything
that you can’t also do with logistic regression.
That’s why a common rule of thumb is don’t use
the linear activation function
in the hidden layers of the neural network