Why do you divide the activations by keep_prob when you use drop

Hi, Ricky.

It looks like you already found this thread, which has a pretty complete discussion of this point. Did you also read the later replies on that thread? E.g. this one, this one and this one?