Hi, my question is from q11 in the 2nd assignment, .
After working further on the problem, you’ve decided to correct the incorrectly labeled data on the dev set. Which of these statements do you agree with? (Check all that apply).
My question is
- How does fixing incorrectly label data in the training set change its distribution?
- Why is it the DL algorithms are robust to having slightly different train and dev distributions?