C3W2_Content-Based Filtering

maab · May 3, 2025, 9:38pm

In the C3_W2_RecSysNN_Assignment, the data was normalized before splitting into training and testing sets. Isn’t this a form of data leakage? Shouldn’t normalization be done after splitting, using only the training data?

rmwkwok · May 3, 2025, 11:51pm

Hello, @maab,

Welcome to this community!

Yes, I am with you on this that those scalars should be fitted with training data only, as the testing data is not supposed to be known at this stage. I will share your post with the course team for their review.

Cheers,
Raymond

maab · May 4, 2025, 4:19am

Thanks for the response!

Topic		Replies	Views
C3_W2_RecSysNN_Assignment how to download the data Unsupervised Learning, Recommenders, Reinforcement week-2	4	202	April 18, 2024
Collaborative Filtering - problem with implementation on raw dataset Unsupervised Learning, Recommenders, Reinforcement week-2	38	381	June 16, 2024
C3 Week 2 - Error in 'Deep Learning for Content-Based Filtering' Assignment Unsupervised Learning, Recommenders, Reinforcement week-2	3	583	October 19, 2022
C3_W2 - Practice Lab 1: Mean Normalization Unsupervised Learning, Recommenders, Reinforcement week-2	6	570	August 24, 2022
C3_W2_RecSysNN issue Unsupervised Learning, Recommenders, Reinforcement week-2	3	513	August 1, 2022

C3W2_Content-Based Filtering

Related topics