C3W2 Train accuracy high but validation accuracy very low

houyao · February 15, 2025, 5:52pm

When creating a post, please add:

Week # must be added in the tags option of the post.
Link to the classroom item you are referring to: C3W2_Assignment.ipynb

Int the final C3W2 assignment, I got a model passing all the tests, but when I ran fit, the training accuracy gets to 0.99 but val_accuracy stays around 0.2. I tried many variations of the parameters in the model and different optimizers (‘adam’, ‘rmsprop’), but the results do not change much. Anyone has any suggestions? Or seen similar results?

Deepti_Prasad · February 15, 2025, 6:29pm

this would mean your model dataset is underfitting if considering training accuracy achieving 0.99

did you look at the labels and text being correctly split and assigned to the dataset?

if I had this issue, I would start from my first grade function, check if I have correctly written the codes.

Also make sure your previous test cell have passed as expected.

houyao · February 15, 2025, 6:32pm

Thank you for your quick reply. I checked each test code, they all produce the expected results (or passed). I’ll re-examine my code.

houyao · February 15, 2025, 8:13pm

I’m confused why I can’t use
vectorized_text = text_vectorizer(text_only)
where text_only is splitted out from the data passed in preprocess_dataset(), instead I have to use
vectorized_text = text_only.map(text_vectorizer)
This first gives me an error, but I suspect the 2nd does not do what I need to do. Am I missing something here?

Deepti_Prasad · February 15, 2025, 8:15pm

ok your msg is bit confusing, can you first tell me which one did you use and which grade function cell you used?

check this thread

Also make sure your model follows the architecture instructions give in the assignment, like using the number layers and then dense output layers.

Also confirm what loss you have used categorical crossentropy or sparse categorical crossentropy??

remember here classes are divided into labels, so sparse would be the choice right?

houyao · February 15, 2025, 8:26pm

[quote=“Deepti_Prasad, post:5, topic:770525”]
I’m using sparse categorical crossentropy.
I was talking about within preprocess_dataset(), a dataset, a text_vectorizer and a label_encoder is passed in. First I extract text_only and label_only by using map(). Then I’m supposed to use the text_vectorizer in text_only and label_encoder on the label_only.

Deepti_Prasad · February 15, 2025, 8:31pm

in the preprocess step you are suppose to only process that step once but the next step is for adding the number of batches and not for the text and labels.

go through link comment I shared. see if that helps, otherwise send me screenshot of the codes by personal DM.

houyao · February 15, 2025, 9:07pm

I went through your comments in this and the other thread, and tried a few things, still about the same results. Sent screenshots by personal DM.

Deepti_Prasad · February 15, 2025, 9:46pm

you seem to have completely mixed up the codes.

in the train val dataset the first step was to create train_size using the TRAINING SPLIT, and then you were suppose to split the text and labels, and further dividing them into training text, training labels, validation text and validation labels.

then creating the training dataset and validation dataset from the respective text and labels.

in the preprocess too you have added your own codes. this cell only had two code lines mentioning dataset for both step where one focused on the creating the dataset and the next dataset created the batches.

please get a fresh copy, redo your assignment. make sure only write codes as per instructions already given.

Deepti_Prasad · February 15, 2025, 9:49pm

@houyao

in create model, just below the header grade function, you seem to have import tf.keras.regularizer.

please do not add, delete or edit anything outside the markers ### START AND END CODE HERE###, or you will end up submission failure even if your codes are correct.

houyao · February 15, 2025, 9:50pm

Thanks. I’ll make some changes following your instructions.

Deepti_Prasad · February 15, 2025, 9:51pm

get a fresh copy and then do, too many changes done.

houyao · February 15, 2025, 10:34pm

Thanks. I got it working with a fresh copy. Thank you for your help, but I’m not exactly sure what the key point that made my original code not working.

Deepti_Prasad · February 16, 2025, 4:46am

according to your recent codes shared you are still hard coding the dataset.

here is a screenshot of the grade function without codes

you do not require to again assign text and labels into the dataset as it was already done.

Also you do not require to separately assign the text and labels to its function recall and the. again use it in dataset.

you are suppose to define the dataset using the text and labels with their respective function recall in one single step.

the next dataset step mentions about the batch size.

houyao · February 16, 2025, 4:59am

Thanks for mentioning this. I have fixed the issue with your instructions by getting a new copy of the workbook.

Houyao Pan

houyao · February 16, 2025, 5:05am

Here is my last version of preprocess_dataset():

GRADED FUNCTION: preprocess_dataset

def preprocess_dataset(dataset, text_vectorizer, label_encoder):
“”"Apply the preprocessing to a dataset

Args:
dataset (tf.data.Dataset): dataset to preprocess
text_vectorizer (tf.keras.layers.TextVectorization ): text vectorizer
label_encoder (tf.keras.layers.StringLookup): label encoder

Returns:
tf.data.Dataset: transformed dataset
“”"

removed by mentor

return dataset

This one seems to be working, but the first part that creates the dataset is not in one line. Really appreciate your help.

Houyao

Deepti_Prasad · February 16, 2025, 5:06am

please don’t post codes here

it is still incorrect. read my previous comment carefully.

see the image I shared without codes which has only two code lines. you are suppose to only replace those None

houyao · February 16, 2025, 5:17am

Thank you. I thought I was replying in my email. I get that, thanks.

Topic		Replies	Views
Week 2: Diving deeper into the BBC News archive Natural Language Processing in TensorFlow week-module-2	1	22	May 26, 2025
TF C3W3 assignment results too good Natural Language Processing in TensorFlow week-module-3	7	77	December 15, 2023
C3W2 assignment model accuracy way too low with multiple architectures Natural Language Processing in TensorFlow week-module-2 , week-module-3 , week-module-4	1	449	July 14, 2023
Has anyone been able to score a good validation accuracy with the C3W3 assignment model? Natural Language Processing in TensorFlow	3	482	August 31, 2022
C3W2 - problem with Natural Language Processing in TensorFlow week-module-2	3	193	May 5, 2024

C3W2 Train accuracy high but validation accuracy very low

GRADED FUNCTION: preprocess_dataset

Related topics