How is this Training loss computed?

When creating the fully-fine tuned model, how is this loss computed? Like how is it compared with the labels? Is it calculated using ROUGE or BLEU score or something?

It is cross-entropy loss computed during training phase have a look on this thread:

Also you can understand how this is calculated by going though the NLP specialization offered by DLAI!

