Failed test case in train_val_split in Assignment

bluetail · April 1, 2022, 12:55pm

for train_val_spit() I have got the same output as expected when testing my function in the next cell.

However, the grader gives me an error:

Failed test case: incorrect number of training sentences when using split of 0.5 and a total of 2225 sentences.
Expected:
a value close to 1112 with absolute tolerance of +/- 1,
but got:
1780.

Failed test case: incorrect number of validation sentences when using split of 0.5 and a total of 2225 sentences.
Expected:
a value close to 1112 with absolute tolerance of +/- 1,
but got:
445.

My function is as follows:

# Compute the number of sentences that will be used for training (should be an integer)
train_size = 1780

# Split the sentences and labels into train/validation splits
train_sentences = sentences[0:train_size]
validation_sentences = sentences[train_size:]

…
what can be wrong?

ai_curious · April 1, 2022, 1:27pm

Just a guess, but the comment says Compute the number of sentences for train_size. But you hard code it. As a general coding practice, harcoding lengths and sizes is not preferred. In these classes, it is a recipe for unit test and/or autograder unhappiness. Try treating the split ratio and the corpus size as variables and computing the train_size and let us know what you find?

bluetail · April 1, 2022, 3:07pm

Thank you.
I have tried to use

`

train_size = len(sentences)* training_split

`

and am getting:
TypeError: slice indices must be integers or None or have an index method

bluetail · April 1, 2022, 3:28pm

thanks. I have used int() and then I passed.

ai_curious · April 1, 2022, 3:32pm

There is almost always more than one way to accomplish something built in to Python, and converting a floating point number to an integer is no exception (pun intended)

Here is one: Built-in Functions — Python 3.12.0 documentation

The +/- 1 in the unit test error message is an acknowledgment that there are others that produce similar but not exact results. For conversation, here are some of them…

https://numpy.org/doc/stable/reference/generated/numpy.ceil.html

https://numpy.org/doc/stable/reference/generated/numpy.floor.html

Topic		Replies	Views
Two questions on C3W2 assignment Natural Language Processing in TensorFlow week-2 , week-3 , week-4	1	570	June 19, 2022
TensorFlow Developer Specialization NLP Assignment W3 Natural Language Processing in TensorFlow week-3	2	16	February 16, 2025
Failed test case: incorrect number of (training, validation) at assignment submission Convolutional Neural Networks in TensorFlow week-1	1	158	April 26, 2024
C3W2 Lab2 Python Slicing Natural Language Processing in TensorFlow week-2 , week-3 , week-4	1	530	October 8, 2022
Programming Assignment: Cats vs Dogs failed tests for split_data Convolutional Neural Networks in TensorFlow week-1	3	645	March 31, 2023

Failed test case in train_val_split in Assignment

Related topics