C3_W1 Exercise 5

Joao_Carlos_Lima_Sel · June 19, 2023, 2:44am

Hi guys, I recommended a discussion above, a topic created by DebbieA, that you should have a look carefully. There I detailed most of the ex.5.

I’ll try to add a complement about data frames and the first dict part which I noticed it is the core of your problems.

Let’s understand the “df_breed = None” part. You want to pick a slice from the data frame, the lines that match the actual breed (remember you’re looping over breeds [0,1,2]) but just the columns with the features. So you can put a condition to call the data frame for a specific breed_i this way:

df_breed = df[df[breed] == breed_i][features]

P.S. I used “breed_i” but you must match the iterator of the FOR LOOP (the loop for the breeds) you coded in the previous step, which can be x, n, breed, etc.
Now you’ll calculate the proportions of each breed_i. The slice you picked above, “df_breed” has a number of lines corresponding for only one breed. The data frame “df” has a number of lines corresponding to all breeds. So, just divide number of lines of “df_breed” by the number of lines of “df”. In Python you can do this count, for each breed:

probs_dict[breed] = len(df_breed) / len(df)

To make it more elegant, you can round as suggested by the instructions. Just use “round(len(df_breed) / len (df), 3)”. The number 3 is the number of decimal digits.

Check the rest of the hints I posted on the recommend discussion:

Topic		Replies	Views
C3_W1_Assignment: Exercise 5 Probability & Statistics for Machine Learning &... week-1	1	493	June 19, 2023
C3 W1 Assignment Exercises 5 - 10 Probability & Statistics for Machine Learning &... week-1	16	1029	November 23, 2023
C3_W1 Exercise 5 Expected Output Doesnt match Probability & Statistics for Machine Learning &... week-1	8	548	June 14, 2023
Week 1 Exercise 5 Probability & Statistics for Machine Learning &... week-1	4	572	June 3, 2023
C3_W1_Assignment exercise number 6 Probability & Statistics for Machine Learning &... week-1	2	439	September 24, 2023