I am trying to create a database of words and which language they come from for a Turkish language. I would like to use the data for RNN. I created a web scraping tool to search for a word and get the concerning data. However I had a problem since the dictionary gives the data I want for the root of the word. For instance if the root of a word is adjective, when I searched for the adverb version of the word, it does not show the same info. I was wondering if there is any efficient way to search for the root of a word or vector database solution to this problem.

Have you heard of lemmatization ?