We can do it whichever way we want, as long as all the maths are correct, but unless it comes to an assignment and the autograder has some requirement about it.
As for “standard”, I think it’s more about who and what tool you are working with. Tensorflow arranges weights in a matrix of shape (number of neurons(features) in the last layer, number of neurons in this layer).
Cheers,
Raymond