Assignment 1 understanding

Mugheera_Saleem · April 27, 2023, 10:35am

Hi! I am going through the 1st assignment of this course. I understand the part about the differences in schema of the two datasets ie training and serving. But I don’t understand how are we removing the anomalies from the dataset? What are we doing by getting the domain of some features? Please if someone can make me understand what are we trying to achieve in the part 6 of the assignment (Schema Environment) it would be great, thanks.

Isaak_Kamau · April 28, 2023, 7:04am

Hello @Mugheera_Saleem
Which is the exact assignment?
In most cases, you are defining the schema of your data and applying it to both the training and serving datasets. This involves identifying the expected format and data types of each feature, as well as any allowable ranges or domains. By doing this, you can ensure that the datasets are consistent and compatible with the machine learning model you will be training and deploying.

About getting the domain of some features mostly means identifying the range of values that a particular feature can take on. For example, if you have a feature that represents age, the domain of that feature would be the range of possible ages (e.g. 0-100 years). By getting the domain of the features, you can identify any anomalies that fall outside of this range and remove them from the dataset.

Mugheera_Saleem · April 29, 2023, 5:57am

Course 2 of MLEP, week-1 assignment.

Topic		Replies	Views
Week 1 Assignment Machine Learning Data Lifecycle in Production	1	553	July 20, 2022
What is the association between domain and features in ML and pandas dataframe? Machine Learning Data Lifecycle in Production	2	539	June 11, 2021
C2W1_Assignment Extercise 9 Machine Learning Data Lifecycle in Production	1	542	September 6, 2021
Problem in Week 1 Assignment Machine Learning Data Lifecycle in Production	2	570	September 8, 2022
C2W1_Assignment Exercise_7 Machine Learning Data Lifecycle in Production week-1	6	493	December 24, 2023

Assignment 1 understanding

Related topics