Aligning LLMs with Direct Preference Optimization

2 Likes