Week1 collaborative filtering

flyunicorn · July 5, 2025, 6:52pm

Two questions on this slide:

if the first term (before the regularization term) of each the cost function is the same, then when they are added together, why it still has half as the coefficient instead of 1?
If I understand the video on collaborative filter correctly, the logic of the final cost function is that the first cost function is using known x and y to deduce w and b. The second cost function is using known w, b and y to deduce x. Since the first term (before regularization term) is the same in both cost functions, then adding them together become using known y to deduce w, b and x. Is my understanding correct?

image1387×792 108 KB

rmwkwok · July 7, 2025, 3:09am

Hello @flyunicorn, we don’t add them together. We put them together by adding the error term and the two regularization terms together.

The 3 cost functions corresponds to three separate cases:

the top one assumes only x and y are given as data
the middle one assumes only w, b, and y are given as data
the bottom one assumes only y is given as data

Obviously the three cases can’t co-exist, so when you think about the bottom one, you couldn’t assume the first two to hold true. In other words, the bottom one is not a sum of the first two. They are separate cases and we form each of them but summing the error and regularization (applying to all trainable parameters).

I suggest you to watch the video again to see that Andrew was only discussing them as individual cases.

Cheers,
Raymond

flyunicorn · July 7, 2025, 7:42am

Hi Raymond,

Since the bottom one is more close to reality where in most cases we only know some users’s rating on some movies, we don’t know w, b and x. Why we need the other two cases?

rmwkwok · July 19, 2025, 11:30pm

Hello, @flyunicorn,

It’s a very good point.

I agree with you that the other two cases are rare and, in fact, in my opinion, in the context of collaborative filtering, we simply won’t know w, b and x at all. If, for example, we knew x (features for movies), the algorithm that applies here would become “content-based filtering” which is covered later in the week.

I think the other two cases were covered here simply as two additional examples of how we would construct a cost function given what we have, which is essentially by putting together the error term and the regularization term. Our only focus, for collaborative filtering, should be the last case.

Cheers,
Raymond

Topic		Replies	Views
C3_W2_Collaborative_RecSys_Assignment Last Checkpoint: 27 minutes ago Unsupervised Learning, Recommenders, Reinforcement week-module-2	15	754	March 16, 2024
C3_W2_Collaborative_RecSys_Assignment Regularization Unsupervised Learning, Recommenders, Reinforcement week-module-2	14	724	November 22, 2022
Collaborative filtering Unsupervised Learning, Recommenders, Reinforcement week-module-2	4	524	December 26, 2022
Regularization C3_W2_Collaborative_RecSys_Assignment: Exercise 1 errors Unsupervised Learning, Recommenders, Reinforcement week-module-2	3	579	August 19, 2022
Collaborative Filtering Assignment Unsupervised Learning, Recommenders, Reinforcement week-module-2	3	398	August 14, 2023

Week1 collaborative filtering

Related topics