OpenAI rolls back sycophantic GPT-4o update

Community-Team · May 2, 2025, 8:20pm

What are your thoughts on this? Reply below

OpenAI reverted an April 25th update to GPT-4o that made the model excessively agreeable, particularly when validating users’ negative emotions. The update combined several changes that weakened the model’s primary reward signal, including a new signal based on user feedback that likely amplified this behavior. Despite positive results in offline evaluations and limited A/B testing, the company failed to adequately weigh qualitative concerns from expert testers who noticed the model’s behavior “felt slightly off.” OpenAI says it has implemented several new safeguards, including treating model behavior issues as launch-blocking concerns, introducing an “alpha” testing phase, and committing to more proactive communication about model updates. (OpenAI)

Subscribe for free access to Data Points!

DriftLau · May 3, 2025, 10:04am

I think it highlights how tricky it is to balance helpfulness with authenticity in AI. Being overly agreeable might seem harmless at first, but it can lead to trust issues or reinforce negative thinking in subtle ways. Good to see OpenAI taking a more cautious approach going forward, feedback from expert testers should carry more weight.

lukmanaj · May 3, 2025, 11:30am

I think it’s the right call. And the new safeguards should help. Hopefully, we can avoid a repeat going forward.

Topic		Replies	Views
Not sure if anyone else saw the OpenAI 'Spring Update'/GPT-4o? AI Discussions ai-discussions	3	195	May 16, 2024
GPT-4 Has Landed: Everything you need to know about GPT-4 AI Discussions the-batch , ai-discussions	5	115	March 23, 2023
Seeking Insights: GPT-4 for Static Analysis/Formal Verification of Blockchain Smart Contracts AI Discussions	0	55	May 16, 2023
OpenAI launches GPT-4.1 model family AI Discussions ai-discussions , data-points	3	116	April 17, 2025
Lab 3 Qualitative Evaluation of PPO model; wonky results Generative AI with Large Language Models week-3	1	443	July 24, 2023

OpenAI rolls back sycophantic GPT-4o update

Related topics