Here’s another thread that discusses the question of why you don’t just reduce the size of the network to eliminate overfitting instead of applying dropout.
Here’s another thread that discusses the question of why you don’t just reduce the size of the network to eliminate overfitting instead of applying dropout.