Typo error in code - L1: NLP tasks with a simple interface

vsrinivas · September 13, 2023, 4:14pm

In this lesson, the code under section " Adding a helper function to merge tokens" is missing to append the last_token variable to merged_tokens list (in fact reassign new value for the last token of the list).

I believe that part of code shalle be ,

last_token = merged_tokens[-1]
last_token['word'] += token['word'].replace('##', '')
last_token['end'] = token['end']
last_token['score'] = (last_token['score'] + token['score']) / 2
# Missing code
merged_tokens[-1] = last_token

I have made the following changes for better result of mergers

if (merged_tokens and token['word'].startswith('##')) or (merged_tokens and token['entity'].startswith('I-') and merged_tokens[-1]['entity'].endswith(token['entity'][2:])):
            last_token = merged_tokens[-1]
            last_token['word'] += token['word'].replace('##', '')
            last_token['end'] = token['end']
            last_token['score'] = (last_token['score'] + token['score']) / 2
            merged_tokens[-1] = last_token

Hope the above helps.

Topic		Replies	Views
Error running NER merge_tokens code in Colab Building Generative AI applications with Gradio	0	121	October 26, 2023
Problem with Ex 7 and 10 in the final assignment NLP with Probabilistic Models week-3	7	844	September 20, 2022
C2W3 UNQ_C7 unittests failing NLP with Probabilistic Models week-3	5	425	July 20, 2023
Exercise 7 - preprocess_data is failing NLP with Probabilistic Models week-3	10	563	March 28, 2023
C2W3 Assignment Exercise 7, 8, and 10 NLP with Probabilistic Models week-3	6	283	May 9, 2024

Typo error in code - L1: NLP tasks with a simple interface

Related topics