Understanding number of parameters in an RNN

ckangai · February 26, 2023, 2:47pm

I have written a very simple RNN and am trying to understand how it came up with the number of parameters. The code gives 24 as the number of learnable parameters for layer l1 but I was expecting 100. My understanding was that it concatenates the hidden vector (length 4 in this case) with the input vector (length 20 in this case) to give a combined input size of 24, and with a hidden layer size of 4 I should get 24x4 weights plus 4 biases = 100 parameters in total for layer l1. But on running the code below I am getting only 24 parameters for layer l1. It seems that it is counting the input vector size as being of size 1. What am I missing here? Is it not supposed to have a different weight for each step in my time series window of size 20 instead of a single weight for the entire window?

import tensorflow as tf
l0 =   tf.keras.layers.Input(shape=(20,1))
l1 = tf.keras.layers.SimpleRNN(4)
l2 = tf.keras.layers.Dense(1)

model = tf.keras.models.Sequential([
    l0,
    l1,
    l2
])
# Print the model summary
model.summary()

(20 features +4 hidden inputs ) x 4 hidden layers + 4 biases = (24) x 4 + 4 = 100

TMosh · February 26, 2023, 9:35pm

This post is in the wrong topic. Mentors don’t monitor the “General Discussion” forum very often.

I’ve moved this to DLS Course 5 forum, as that seems to be the matching topic.

balaji.ambresh · February 27, 2023, 5:10am

An RNN layer is shared across multiple timesteps of an input and hence the term BPTT. As far as the number of parameters is concerned, here’s the configuration in terms of DLS for your model:
W_{ax} = (1, 4) # (input.shape[-1], num_hidden_units)
W_{aa} = (4, 4) # (num_hidden_units, num_hidden_units)
b_a = (4, ) # (num_hidden_units, )

The total number of weights is 4 + 16 + 4 = 24

See this link for SimpleRNNCell

Topic		Replies	Views
Week 1 - Confusion about RNN Architecture Sequence Models coursera-platform	2	90	June 22, 2024
How to compute RNN parameters? Sequence Models coursera-platform	3	1435	July 18, 2024
Optimal number of weight parameters for RNN? Sequence Models coursera-platform	4	538	May 22, 2021
About RNN parameter Convolutional Neural Networks in TensorFlow week-module-2	6	532	December 16, 2021
Number of Parameters in a CNN model Convolutional Neural Networks week-module-1 , coursera-platform	1	95	June 30, 2024

Understanding number of parameters in an RNN

Related topics