Weight sets - LSTM - Siamese

Nevermnd · August 1, 2024, 9:45pm

@Deepti_Prasad perhaps you can help me on this as I know you are one of the few NLP mentors.

I get the point they make that the weights are ‘shared’, but does this happen when we are still in the dual LSTM network stage, or are they saying we do this once we go to the cosine similarity juncture ?

I’m not quite at the assignment yet but I am finding this really confusing… I’d imagine you’d share weights at the LSTM stage, but have no idea how you’d do that.

What are they talking about ?

gent.spah · August 2, 2024, 2:48pm

We had this question here before, have a look in this post:

As far as I remember its one base network shared with the inputs!

Nevermnd · August 2, 2024, 3:02pm

Thank you for letting me know/reminding me.

Perhaps I just wasn’t sure what question I should ask.

Nevermnd · August 3, 2024, 9:44am

Just as I work through this… I (think ?) there is a mistake in the notebook:

Or in the args list it cites d_model as a default of 128… But I am presuming they mean ‘d_feature’… Or otherwise I’m not sure where this value is coming from…

gent.spah · August 3, 2024, 10:21am

I think you are right here! Let me try and leave a note on the repo!

Topic		Replies	Views
C3_W4 UNQ_C5 : problem with loading the weights NLP with Sequence Models week-4	10	741	October 25, 2023
How is backpropagation implemented in siamese networks NLP with Sequence Models week-4	1	527	August 28, 2023
I presume we still have a bias term-- NLP with Sequence Models week-1	3	13	July 25, 2024
Backpropagation in RNN weight sharing Sequence Models	4	818	February 23, 2022
Assignment 3: Question duplicates_Exercise 01 _siamese NLP with Sequence Models week-3	5	33	November 6, 2024

Weight sets - LSTM - Siamese

Related topics