OpenAI admits it screwed up testing its ‘sychophant-y’ ChatGPT update

May 5, 2025
2,470 Views

Last week, OpenAI pulled a GPT-4o update that made ChatGPT “overly flattering or agreeable” — and now it has explained what exactly went wrong. In a blog post published on Friday, OpenAI said its efforts to “better incorporate user feedback, memory, and fresher data” could have partly led to “tipping the scales on sycophancy.”

In these updates, OpenAI had begun using data from the thumbs-up and thumbs-down buttons in ChatGPT as an “additional reward signal.” However, OpenAI said, this may have “weakened the influence of our primary reward signal, which had been holding sycophancy in check.” The company notes that user feedback “can sometimes favor more agreeable responses,” likely exacerbating the chatbot’s overly agreeable statements. The company said memory can amplify sycophancy as well.

OpenAI says one of the “key issues” with the launch stems from its testing process. Though the model’s offline evaluations and A/B testing had positive results, some expert testers suggested that the update made the chatbot seem “slightly off.” Despite this, OpenAI moved forward with the update anyway.

“Looking back, the qualitative assessments were hinting at something important, and we should’ve paid closer attention,” the company writes. “They were picking up on a blind spot in our other evals and metrics. Our offline evals weren’t broad or deep enough to catch sycophantic behavior… and our A/B tests didn’t have the right signals to show how the model was performing on that front with enough detail.”

Going forward, OpenAI says it’s going to “formally consider behavioral issues” as having the potential to block launches, as well as create a new opt-in alpha phase that will allow users to give OpenAI direct feedback before a wider rollout. OpenAI also plans to ensure users are aware of the changes it’s making to ChatGPT, even if the update is a small one.

Source link

You may be interested

Epic’s Mega sale has big discounts on games like GTA V, Red Dead Redemption, and Cyberpunk 2077
Technology
shares2,767 views
Technology
shares2,767 views

Epic’s Mega sale has big discounts on games like GTA V, Red Dead Redemption, and Cyberpunk 2077

new admin - May 16, 2025

It’s a great time to catch up on some big games you might’ve missed out on recently. Epic Games is…

11 inmates escape New Orleans jail, considered “armed and dangerous”
Top Stories
shares3,550 views
Top Stories
shares3,550 views

11 inmates escape New Orleans jail, considered “armed and dangerous”

new admin - May 16, 2025

How common are prison escapes? How common are prison escapes? 03:29 Eleven inmates considered "armed and dangerous" escaped a New…

Foo Fighters, Drummer Josh Freese Part Ways
Music
shares2,532 views
Music
shares2,532 views

Foo Fighters, Drummer Josh Freese Part Ways

new admin - May 16, 2025

[ad_1] "I’ve never been let go from a band, so while I’m not angry — just a bit shocked and…