Adding Training data which distribution differs from Dev/Test sets

We build neural networks to model the real-world data distribution. If we already know that distribution, we donโ€™t need neural networks. In other words, if we knew everything that may happen, there would be nothing to predict :slightly_smiling_face:

Evaluation of generative models quite a problem itself :slight_smile: