Question about Course 2 Week 1 Exercise 4

tjamesbu · June 26, 2021, 4:13pm

Anyone able to help me figure out the first part of the coding in Course 2 Week 1 Exercise 4? I’ve tried multiple combinations already attempting to follow the instructions, but the instructions are not very clear.

Generate evaluation dataset statistics

HINT: Remember to use the evaluation dataframe and to pass the stats_options (that you defined before) as an argument

eval_stats = tfdv.generate_statistics_from_dataframe(train_stats, stats_options= train_stats)

davidlowe · June 26, 2021, 9:20pm

I think you are close with a few adjustments to consider…

Back in Section 2, we splitted the dataset into three dataframes. One of them is the Evaluation dataframe, and you might want to consider using it.
Back in Section 3, we Instantiated a StatsOptions class and defined the feature_whitelist property. You might want to try using this object in generating the eval_stats.
Check out the API call for tfdv.generate_statistics_from_dataframe at tfdv.generate_statistics_from_dataframe | TFX | TensorFlow. It will tell you what data structure it is expecting for the arguments.

tjamesbu · June 26, 2021, 10:02pm

What am I still missing here for Week 2 Exercise 5?

START CODE HERE

# HINTS: Pass the statistics and schema parameters into the validation function 
anomalies = tfdv.validate_statistics(eval_stats, schema)

# HINTS: Display input anomalies by using the calculated anomalies
tfdv.display_anomalies(anomalies)
### END CODE HERE

I am getting a response of No Anomalies Found when the instructions state we are supposed to get some details about medical_specialty and glimepiride-pioglitazone features.

davidlowe · June 27, 2021, 12:46am

That will depend on which dataframe was used to generate the stats which, in turn, led to the generation of the schema. Here is a post with similar issue that might provide some clues to your situation.

Topic		Replies	Views
Week 3, C3W3_Assignment in section 4-5, classify and evaluate, How to pass the tf.data.Dataset.from_tensor_slices output to model, to get the prediction NLP with Sequence Models week-3	9	59	September 29, 2024
Bug in the grader Advanced Deployment Scenarios with TensorFlow week-2	8	644	January 21, 2023
Data Validation C2W1 Exercise 8 Assignment Machine Learning in Production	1	45	August 9, 2021
Error: TypeError("__init__() missing 2 required positional arguments: 'op' and 'message'",) Data Pipelines with TensorFlow Data Services week-1	28	2207	December 26, 2024
Grade error: Sorry, your submission was incorrect. Please try again. ‘'white_df' is not defined & 'red_df' Custom Models, Layers and Loss Functions with TF week-1	2	624	June 19, 2022

Question about Course 2 Week 1 Exercise 4

Generate evaluation dataset statistics

HINT: Remember to use the evaluation dataframe and to pass the stats_options (that you defined before) as an argument

START CODE HERE

Related topics