How to evaluate LLMS for labeling use case

I am working on a use case to Label data by classifying text messages under 4 different labels. How can I evaluate the performance of the classification model?

Have you ever heard of training, validation and testing? If not I suggest you take the Deep Learning Specialization!

1 Like