Build Automatic Speech Recognition with DL

Hi everybody,
What is the method to build ASR : LSTM or GRU. If I create a dataset audio and transcript for low ressources languages, what is the best way?

In the deep learning specialization they have a Lab with speech recognition, I would think it would helpful to you!