Determine size of n_a

zhiyong9654867 · March 10, 2022, 1:11am

Hi,

I was working through week 1’s programming exercises and I realized I wasn’t sure how n_a is determined. The size of activations for RNNs is defined as (𝑛𝑎,𝑚,𝑇𝑥) but how is na determined?

paulinpaloalto · March 10, 2022, 2:48am

The size of the activations is a “hyperparameter”, meaning that it is simply a choice you need to make as the system designer. You need to choose a value that captures the complexity of the “state” that you need to track in the nodes of your RNN. The way such choices are made is by experience and intuition. Then you check whether your choices are good by the performance of your resulting trained model. If you choose too small a value, the model may not perform very well (“underfitting”). If you set it too large, then it is more costly to train your network.

Topic		Replies	Views
W1 A1 dimensions of n_a and n_y? Sequence Models week-1	2	7	December 13, 2024
W1A2 - How Are Shapes Determined Sequence Models week-1	1	122	May 25, 2024
RNN dimensions of activation a0 Sequence Models	1	527	August 26, 2021
I have some fundamental questions Sequence Models	3	535	October 1, 2021
How n_a is decided for a_next Sequence Models	1	503	November 4, 2021

Determine size of n_a

Related topics