[CS229] probability interpretation of Linear Regression

WONG_Lik_Hang_Kenny · August 15, 2023, 9:36am

Full Screenshot:

how to get the part that is circled by red below? 1. there is “This implies that” on the note but i dont quite get how it works.

It is from the page 16 of the CS229 main note.

rmwkwok · August 15, 2023, 12:30pm

But the steps are already on the notes, what don’t you get?

(Follow my steps 1, 2, 3, and 4)

WONG_Lik_Hang_Kenny · August 15, 2023, 12:50pm

Hi Raymond, thanks for your reply. Let me clarify a bit, the point i actually dont get why is it converted to $$p(y^{(i)}|x^{(i)};\theta)$$.
From
p(epsilon of i)

rmwkwok · August 15, 2023, 2:05pm

Because we have replaced epsilon with y, x, and theta. What is the problem? Perhaps you can try to explain something you have got from that? Maybe then I can make some comments or understand you a bit better?

TMosh · August 15, 2023, 3:06pm

CS229 is not a DLAI course.

WONG_Lik_Hang_Kenny · August 15, 2023, 3:19pm

I know it is replacing epsilon with y^(i)-theta^T x^(i), but why in the format of p(y(i) | x(i) ; theta) ?

WONG_Lik_Hang_Kenny · August 15, 2023, 3:23pm

Like it can be p(y(i),x(i) ; theta), why does y^(i)-theta^T x^(i) give this particular form of the probability which is P of y given x and theta?

rmwkwok · August 16, 2023, 12:56am

Hello @WONG_Lik_Hang_Kenny

Because you can evaluate the probability of y given x, and the equation is parameterized by theta. However, you probably won’t be satisfied with my above answer though I am trying to make it very straightforward.

Put it this way, y is probabilistic because of epsilon, and the normal distribution assumption is for epsilon and, as a result, “transferred” to y due to y = thetaT x + epsilon. x is considered given, and we have not made any assumption on the probabilistic nature of x itself (e.g. we have not assumed any probability distribution on x), so that equation will not evaluate the probability of x, not the joint probability of x and y, but only y.

Cheers,
Raymond

Topic		Replies	Views
Week 2: Explanation of Logistic Regression Cost Function Neural Networks and Deep Learning coursera-platform	1	575	June 23, 2021
Week 2 Logistic Regression Cost Function Video Neural Networks and Deep Learning coursera-platform	1	535	December 15, 2021
Clarification about retrieval tradeoff Unsupervised Learning, Recommenders, Reinforcement week-module-2	1	397	July 22, 2023
What does P(y(i,j)) indicate in the below screenshot? Unsupervised Learning, Recommenders, Reinforcement week-module-2	1	10	January 26, 2025
Week3 - Beam Search Error Analysis Sequence Models coursera-platform	1	526	April 21, 2022

[CS229] probability interpretation of Linear Regression

Related topics