C1_W4_Final_Assignment There was a problem with data included

duongthuytrinh2004 · February 18, 2025, 3:51am

Hello, there was a problem with this embedding dataset. When I tried to complete these part below ( test_get_document_embedding and test_get_document_vecs), it keeps announcing problem relating to this dataset.

. I tried to do everything but can not fix it

TMosh · February 18, 2025, 4:00am

It appears the problem is that your get_document_embedding() function does not work correctly.

I say this because the function that is throwing the error is one that tests your code.

paulinpaloalto · February 18, 2025, 4:31am

Yes, as Tom says, this is a bug in your function. Your logic is assuming that every word has an embedding in the dictionary, but that is not correct assumption. Your code needs to handle both cases: with and without an embedding for a given word.

They said this in the instructions for that function:

Document Embeddings

Document embedding is created by summing up the embeddings of all words in the document.
If we don’t know the embedding of some word, we can ignore that word.

duongthuytrinh2004 · February 18, 2025, 5:51am

Yes I tried to change the logic so that it could handle both cases. But it turns out incorrect result when I run final test. I will show you my result below:

And this is my fixed function:

{moderator edit - solution code removed}

paulinpaloalto · February 18, 2025, 5:54am

It is not sufficient just to use “split()” on the tweet. You just wrote a function called process_tweet, right?

duongthuytrinh2004 · February 18, 2025, 5:55am

Omg right, thank you so much, I would do it again

paulinpaloalto · February 18, 2025, 5:56am

Sorry, I was wrong: you didn’t write that function, but it is imported for you and you’ve used it before in this assignment, right?

duongthuytrinh2004 · February 18, 2025, 6:00am

Yes it was imported in the first code block for me. I have achieved All test passed, thank for your dedication sir!

Topic		Replies	Views
C1_W4_Assignment function get_document_embedding embedding input issue NLP with Classification and Vector Spaces notebook , week-4	2	316	January 29, 2024
C1_W4_Assignment: Exercise 7. getting 'key error' NLP with Classification and Vector Spaces week-4	4	195	April 19, 2024
Week 4 assignment Exercise 7 NLP with Classification and Vector Spaces week-4	2	507	April 6, 2023
Exercise 7: get_document_embedding wrong result NLP with Classification and Vector Spaces week-4	2	455	June 17, 2023
C1_W4_Assignment: Problems in grading the submissions NLP with Classification and Vector Spaces week-4	6	547	December 14, 2022

C1_W4_Final_Assignment There was a problem with data included

Document Embeddings

Related topics