Fine Tuning with RLHF

Can we Fine Tune Encoder-Decoder model that is seq2seq model with RLHF ?

If possible any source code to Fine Tune Encoder-Decoder model that is seq2seq model with RLHF.