Here on the notebook, it says “first we will remove teh binary variables, because one hot encoding them would do nothing to them”. But the binary variable Sex is still included here. Is this a mistake?
If it is a mistake, how would you code the variable Sex instead?
I thought the parameter in the function get_dummies should be the columns that will be converted to one-hot encoding, which means only categorical columns that contain more than 2 values. Why the full column names are provided?