OpenAI admits it screwed up testing its ‘sychophant-y’ ChatGPT update

May 5, 2025
2,477 Views

Last week, OpenAI pulled a GPT-4o update that made ChatGPT “overly flattering or agreeable” — and now it has explained what exactly went wrong. In a blog post published on Friday, OpenAI said its efforts to “better incorporate user feedback, memory, and fresher data” could have partly led to “tipping the scales on sycophancy.”

In these updates, OpenAI had begun using data from the thumbs-up and thumbs-down buttons in ChatGPT as an “additional reward signal.” However, OpenAI said, this may have “weakened the influence of our primary reward signal, which had been holding sycophancy in check.” The company notes that user feedback “can sometimes favor more agreeable responses,” likely exacerbating the chatbot’s overly agreeable statements. The company said memory can amplify sycophancy as well.

OpenAI says one of the “key issues” with the launch stems from its testing process. Though the model’s offline evaluations and A/B testing had positive results, some expert testers suggested that the update made the chatbot seem “slightly off.” Despite this, OpenAI moved forward with the update anyway.

“Looking back, the qualitative assessments were hinting at something important, and we should’ve paid closer attention,” the company writes. “They were picking up on a blind spot in our other evals and metrics. Our offline evals weren’t broad or deep enough to catch sycophantic behavior… and our A/B tests didn’t have the right signals to show how the model was performing on that front with enough detail.”

Going forward, OpenAI says it’s going to “formally consider behavioral issues” as having the potential to block launches, as well as create a new opt-in alpha phase that will allow users to give OpenAI direct feedback before a wider rollout. OpenAI also plans to ensure users are aware of the changes it’s making to ChatGPT, even if the update is a small one.

Source link

You may be interested

FIFA Club World Cup match in Philadelphia disrupted by protestors, smoke flares
Sports
shares2,611 views
Sports
shares2,611 views

FIFA Club World Cup match in Philadelphia disrupted by protestors, smoke flares

new admin - Jun 18, 2025

[ad_1] NEWYou can now listen to Fox News articles! The FIFA Club World Cup is underway, where some of professional…

Getting out of debt would be ‘extremely hard,’ advocate says
Business
shares3,617 views
Business
shares3,617 views

Getting out of debt would be ‘extremely hard,’ advocate says

new admin - Jun 18, 2025

[ad_1] Republicans’ “big beautiful” bill, if enacted as drafted, would make some of the biggest changes to the federal student…

Tucker Carlson spars with Ted Cruz on Israel-Iran strikes: “You don’t know anything about Iran”
Top Stories
shares2,568 views
Top Stories
shares2,568 views

Tucker Carlson spars with Ted Cruz on Israel-Iran strikes: “You don’t know anything about Iran”

new admin - Jun 18, 2025

An interview between Tucker Carlson and Sen. Ted Cruz about Israel's military campaign against Iran became contentious when the ex-Fox…