Reusing weights question

mishoo8 · June 21, 2024, 1:39pm

I was wondering why for RNNs we reuse the weights and biases for each time step, and not have different weights for different steps. Thank you!

Also, why is there no week 1 tag?

Alireza_Saei · June 21, 2024, 3:46pm

Hi @mishoo8

In RNNs, weights and biases are reused for parameter sharing purposes, which helps the model to generalize patterns across different positions in the sequence, maintain temporal consistency, and reduce computational complexity due to lower number of parameters.

Hope this helps, feel free to ask if you need further assistance!

rmwkwok · June 22, 2024, 1:02am

Hello, @mishoo8,

If we had different weights for different time steps, then it will not be a “recurrent” neural network, but it will just be a normal neural network that considers each timestep as a feature. In this case, as @Alireza_Saei explained, there will be a lot of parameters if we have many time steps. Also, we lose that temporal order, because the timesteps will be flattened out into features “that stand side-by-side to each other”.

Cheers,
Raymond

mishoo8 · June 22, 2024, 9:27am

Wouldn’t such a network (an RNN with different weights each timestep) still be different from a regular NN? I am having a bit of difficulty trying to understand what makes RNNs capable of leveraging sequences of data compared to regular neural networks, since in both cases activations get passed in all inner units of the networks. Is it that for cells in the first layer, the only input in a NN is just the input X, while for RNNs it is both the input X and the previous activations?

Nevermnd · June 22, 2024, 11:18am

@mishoo8 they way I like to think of it is we are essentially tuning the weights across time. As @rmwkwok points out, were we not doing this as he says as if “each timestep is a feature”.

I mean, if you think about it, though not exactly (and thus I hesitate to use the word ‘temporal’ here)-- but even traditional neural nets have a ‘spatial’ flow of dimensions. The weights of the following layer always depend on the outcomes of all those that came before it-- but in that case we are kind of picking apart or further segmenting the data into certain compartments based upon layers.

In this sense, the layers in a traditional NN are sort of ‘linearly separable’ from one another in a sense, but in RNN we are more so trying to determine the total equation for an operation as a flow through time.

Again-- I am not saying this is even what an RNN is actually doing and unfortunately could not find a Youtube video or tutorial I found that great as an example. But consider the Fourier Analysis of the break down of an audio signal. We get closer and closer to simulating the original signal by adding additional components to our Fourier Transform equation, and this is kind of what each step in the RNN is doing-- ‘tweaking’ our weights to add/refine additional component parts-- Yet this is still all one single signal, not many signals, thus we have only one set of weights (i.e. only one Fourier equation based on the data).

rmwkwok · June 23, 2024, 12:30am

Hello, @mishoo8,

This is the point you need to prove, isn’t this? (We can’t ask someone who does not believe in that to prove that to be true, can we?) Below is how I disprove it, and you need to show your trial:

This is what one neuron in a “regular” NN does:

image925×203 7.6 KB
This is what a neuron that gives each timestep a different weight does, assuming there is only one feature per timestep:

image1207×230 9.18 KB

How do they look any different?

Topic		Replies	Views
Backpropagation in RNN weight sharing Sequence Models coursera-platform	4	825	February 23, 2022
Why do we use the same parameters for different timestamps in RNN? Convolutional Neural Networks coursera-platform	2	524	October 5, 2021
[Week 1] How are the weights updated in backpropagation thorough time? Sequence Models coursera-platform	12	908	July 15, 2023
RNN basic question Sequence Models coursera-platform	1	516	November 1, 2021
RNN (Recurrent Neural Network) AI Discussions ai-discussions	7	134	March 18, 2025

Reusing weights question

Related topics