Can use enforcement learning to build up the recommender system?

use enforcement learning might reduce the popularity biase and provide the variety of plausible products or services to the customers ( with high rewards ).

I did not know if this made sense or not.

is any possible function we can use in TensorFlow or Pytourch?

Thanks!

Reinforcement learning is covered later in this course.