You are welcome @matrix. I have two pieces of hints.
This can help justify my eta0.
eta0
n_iter_no_change has to do with the fact that you are using SGD, so each iteration considers only one sample of your dataset.
n_iter_no_change
Cheers, Raymond