What does data centric AI look for Speech to text use cases?

Anyone started on implementing data-centric AI development for Automatic Speech Recognition for non-English languages, such as Chinese, Korean, Japanese?