Data Skew - joint, marginal and conditional probability?

hdot · August 18, 2021, 9:31pm

Hi all!

I was doing just find with the course until I saw this image on lecture regarding the types of skew and their probabilistic analogies below:

I understand what joint, marginal and conditional probability refer to in general but didn’t quite follow how these concepts relate to the training and serving data.

Does anyone have an explanation or examples to help describe what is going on in this slide?

Thanks in advance

balaji.ambresh · July 18, 2022, 6:35am

Training data refers to the dataset you use to build your model. Usually, this is historical data that’s applicable to your problem.
Serving data refers to what your encounters when deployed. This is the dataset you want the trained model to do well on. Serving could be as simple as invoking your model via an http call for prediction.

With this in mind, please watch the examples in this lecture.

Topic		Replies	Views
Drift and skew difference Machine Learning Data Lifecycle in Production week-1	4	116	August 2, 2024
Isn't Feature Skew one form of Distribution Skew Machine Learning Data Lifecycle in Production	2	540	July 10, 2021
What are the causes of feature skew? Machine Learning Data Lifecycle in Production	5	561	December 8, 2022
What is feature skew Machine Learning Data Lifecycle in Production week-1	5	65	August 1, 2024
Skewed and imbalanced datasets AI Discussions ai-question	2	110	March 31, 2024

Data Skew - joint, marginal and conditional probability?

Related topics