Coffe Roasting in TensorFlow Lab - initial weights?

Svetlana_Verthein · January 26, 2023, 6:20pm

Hello
In this lab C2_W1_Lab02_CoffeeRoasting_TF we have:

Dense performs a sigmoid activation in each layer, which requires weights W and b.
Where do we get the initial weights in the first Dense call here, to work on input x/Layer 0?
Previously we used to initialize weights to some random numbers, but I don’t see here where we initialize the weights here?

Thank you!

gent.spah · January 26, 2023, 6:30pm

Good question. In the tensorflow site for the Dense class of layers:

There are a few selections on kernel (weights) and bias and I think there is one in default if not specified further.

Svetlana_Verthein · January 26, 2023, 6:38pm

thank you, that makes sense.

But then for the second Dense we don’t specify the Kernel either, so by default, it seems to me, it would be the same default Kernel as in the first Dense, but we should be using the one created by the first Dense?
Is Sequential somehow knows to pass Kernel updated in one Dense to the next Dense?
Thank you!

gent.spah · January 26, 2023, 6:47pm

Each layer maybe initialized with similar method like “glorot uniform” but those weights and biases evolve differently for each layer, so I am not sure I understand what you mean by pass the updated kernel to the next layer. Each layer has its own set of weights.

Svetlana_Verthein · January 26, 2023, 6:57pm

Layer 1 uses Kernel from Layer 0 ( kernel_initializer=‘glorot_uniform’).

Layer 2 uses Kernel created by Layer 1.
But in second Dense call we don’t specify that kernel_initializer = ‘Layer1_kernel’ (not sure about the correct syntax, just trying to convey that it is a Kernel generated by Layer 1). So, if we don’t specify kernel_initializer, by default second Dense should also use kernel_initializer=‘glorot_uniform’, correct?
But it doesn’t, it uses the correct Kernel (from Layer 1). So how does Layer 2 knows how to use the correct Kernel if we don’t specify it in the call?
Does the question make sense?
Thank you

TMosh · January 26, 2023, 7:57pm

From reading the TF “Dense” documentation, it looks to me like any Dense layer uses glorot_uniform to initialize the weights, unless you specify something else.

TMosh · January 26, 2023, 7:58pm

Can you post some data that shows this?

Svetlana_Verthein · January 26, 2023, 8:41pm

No data - but from the lectures Layer 2 uses weights (kernel) from Layer 1, not the default ’ ‘glorot_uniform’

So, how do we pass weights (Kernel) from Layer 1 to Layer 2 if we don’t specify it in the second Dense call (in which case Dense is supposed to use the default, ‘glorot_uniform, instead of weights from Layer 1)?
I’m sorry if I am not being clear…

In other words, which weights second Dense uses and how does it gets them?

rmwkwok · January 27, 2023, 2:15am

Hello @Svetlana_Verthein,

In the screenshot from your first post, there are 2 calls to tf.keras.layers.Dense. As @gent.spah pointed out, a tf.keras.layers.Dense uses 'glorot_uniform' by default to initialize the weights. Therefore,

both the Dense will be initialized using the same method 'glorot_uniform'
since one Dense will be initialized after the other Dense, even though they both use the same method, their initialized weights will be different
the weights of both Dense are initialized according to 'glorot_uniform', and it is NOT true that the second Dense will wait for the first Dense to pass its weights. There is no weight passing through layers in weight initialization.

Therefore,

Tensorflow does not pass weights from Layer 1 to Layer 2.

By calling 'glorot_uniform' which can initialize the weights randomly. This is true for both the first and the second Dense.

Could you please share the source of this? Perhaps the video name and timestamp, or which lab and which section of the lab?

Raymond

Svetlana_Verthein · January 27, 2023, 7:37pm

Thank you, Raymond, everything is clear to me now. I see where I was confused - for some reason I thought we were using not just a values, but also weights from the previous layer - now I see i was obviosuly wrong, and the lectures don’t say that.
I really appreciate your patience and clarity of explanation!

rmwkwok · January 28, 2023, 1:57am

You are very welcome, @Svetlana_Verthein!

Cheers,
Raymond

Basira_Daqiq · July 2, 2023, 3:39pm

I had a similar question: how are the initial weights for each layer determined? This thread helped. Thanks!

Topic		Replies	Views
Weight Initalization AI Discussions ai-discussions	14	338	October 8, 2024
Week 1, Optional Lab 02: Question about TensorFlow Neural Networks and Deep Learning week-1	10	44	August 19, 2024
Question about Hidden Layers Advanced Learning Algorithms week-1	2	389	July 25, 2023
Layer 2 Weights Advanced Learning Algorithms week-1	6	349	September 8, 2023
C2_W1_Assignment - Question on Line of Code Advanced Learning Algorithms week-2	3	505	January 17, 2023

Coffe Roasting in TensorFlow Lab - initial weights?

Related topics