I’m currently doing the first assignment of this course (the rnn_cell_forward function so far), and i’m just wondering on what is the optimal number of weight parameters? I don’t remember Andrew talking about the number of weight parameters, but similar to CNNs, I’m guessing this has already been studied by a number of researchers. Anyone knows the number?
As with any NN, the number of weight parameters is a compromise between the quality of the results and the amount of time (and computation) required for training.
There is no specific answer to that question. You didn’t say how many input features you have, for example.
One rule-of-thumb is to start with the square root of the number of input features. Then you have to adjust either higher or lower depending on your specific goals (accuracy vs. training time).