I found Geoffrey Hinton’s comment near the end of the interview interesting when he stated what is modeling (not an exact quote)?

- Apply linear transformations to your measurements.
- Get to the desired state vector where only linear transformations need to be applied.

Is this the principle by which Neural Networks were created. WX + b is any linear transformation. And the loss function helps you adjust the weights (W) to get to the desired state vector?

Just trying to get more clarity on conceptually what the math is doing for us.

– Sam