Course 5 Week 4: Named-Entity Recognition notebook does not seem to use attention mask

kechan · April 16, 2022, 4:11pm

I no longer have current access to the notebook after certificate. But according to the copy I snapshot last year, it does not seem attention_mask was ever used in training the model:

test = tokenize_and_align_labels(tokenizer, df_data['content'].values.tolist(), tags)
train_dataset = tf.data.Dataset.from_tensor_slices((
     test['input_ids'],
     test['labels']
))

the tf training dataset has only input_ids. However, I recently found huggingface notebook on training NER, and it is using the same distillbert model, and it does involve attention_mask in the input. In fact, the tf dataset input is a Dict with both input_ids and attention_mask. Using the attention mask make sense since you don’t want to build representation attended to padded stuff.

So I wonder why the notebook in this specialization never use attention mask?

TMosh · April 19, 2022, 3:37am

I’ll try to find one of the course authors to reply to your question.

kechan · April 19, 2022, 8:03pm

@TMosh Thanks. I wonder if these optional notebooks are being also updated frequently. I copied mine locally last year after completing the specialization, so maybe there’s a chance they fixed it?

I also ran into another issue with this notebook concerning the metrics=[‘accuracy’], i will post a new thread on it.

TMosh · April 20, 2022, 1:35am

The optional assignments do not get much maintenance. Very few students post questions about them.

kechan · April 21, 2022, 4:33pm

I am actually trying out different way to train NER for my own project, so looking closely at this notebook than otherwise. I highly suspect the course instructors/TAs had based this off from huggingface’s own tutorial (maybe older version), with their own variation on how dataset is being prepared. I found some of the code and idea useful.

But the ungraded lab notebook is also deficient in explanation, i suspect those long winding parsing function have bugs… I think last week of course 5 has its fair share of complaints in the past, hopefully they will improve in the future.

Topic		Replies	Views
Assignment 2 - Named Entity Recognition (NER)_Exercise 5_masked_accuracy NLP with Sequence Models week-2	6	92	November 5, 2024
C3W2: Exercise 2,5 and error NLP with Sequence Models week-2	6	768	August 22, 2024
Course 5 Week 4: Named-Entity Recognition notebook Accuracy Metric maybe wrong Sequence Models	2	547	April 20, 2022
NLP with sequence models C3_week3 Failing a unit test Question duplicates assignment part 2 triplet loss NLP with Sequence Models week-3	13	246	May 22, 2024
C3_W2 LSTM and Named Entity Recognition NLP with Sequence Models week-2	2	280	March 20, 2024

Course 5 Week 4: Named-Entity Recognition notebook does not seem to use attention mask

Related topics