Further down it will be explained why so, its basically because the starting features reduce the entropy more that the ones that were not choose at first.