Collaborative filtering is getting complicated

How is last formula working if all three are unknown or is it the algorithm tends to find features and weights that fits feature at same time?

y^{(i,j)} is known. Given this known information, the algorithm, as you said, finds w, x, b that optimizes the cost.


