Hi everyone! ![]()
I’m La Mema Parandy, and I’m exploring the Week 2 labs on fine-tuning LLMs for dialogue summarization. I’ve been learning about full fine-tuning, PEFT, LoRA, and soft prompt tuning, and I’m really curious about how others are applying these techniques.
Here are a few questions I’d love to discuss with you all:
-
Fine-Tuning Trade-offs: How do full fine-tuning and PEFT compare in your projects in terms of performance vs. computational resources?
-
Instruction & Multi-Task Fine-Tuning: How could these approaches be applied to real-world NLP tools like chatbots or summarization systems?
-
Evaluation Metrics: Which metrics (ROUGE, BLEU, HELM, GLUE/SuperGLUE) do you rely on most for judging summarization quality?
-
LoRA & Soft Prompt Tuning: Has anyone tried these methods yet? How did the performance improvements compare to resource savings?
-
Applications & Ethics: How can we apply fine-tuned models responsibly while minimizing bias?
I’d love to hear your experiences, thoughts, or even resources you’ve found helpful. Looking forward to learning from you all!