In Inception Network Motivation video of week 2 of cnn, Andrew said the output of pooling to be (28,28,32) for a input of (28,28,192).
Time:- 2:10
I think in pooling we only have one 2d filter of size f and thus the number of channel at output is equal to no of channel at input
Andrew later clarified in the next video that the max pooling layer was followed by a conv layer with 32 filters.