![]() |
|
![]() |
|
![]() |
|
![]() |
|
![]() |
|
![]() |
|
![]() |
|
![]() |
|
![]() |
|
![]() |
|
![]() |
|
![]() |
|
![]() |
|
![]() |
|
![]() |
|
加密貨幣新聞文章
OpenAI Ignored Concerns from Expert Testers When It Rolled Out an Update to ChatGPT That Made It Excessively Agreeable
2025/05/05 11:32
OpenAI Ignored Expert Testers on GPT-4o Update, Led to Sycophantic Model
OpenAI has disclosed that it disregarded the concerns of its own expert testers regarding an update to its flagship ChatGPT artificial intelligence model, which ultimately led to the model becoming excessively agreeable, according to a recent blog post by the company.
On April 25, the company released an update to its GPT-4o model, introducing changes that rendered it “noticeably more sycophantic,” as noted by OpenAI. However, the company quickly reversed the update three days later due to emerging safety concerns.
The ChatGPT maker explained that its new models undergo a series of safety and behavior checks, with internal experts dedicating substantial time to interact with each new model in the run-up to launch. This final stage is intended to identify any issues that may have been missed during other testing phases.
During the testing of the latest model, which was due to be released on April 20, some expert testers flagged that the model’s behavior “felt” slightly off, impacting its overall tone. Despite these observations, OpenAI decided to proceed with the launch "due to the positive signals from the user experience teams who had tried out the model."
"Unfortunately, this was the wrong call. The qualitative assessments were hinting at something important, and we should’ve paid closer attention. They were picking up on a blind spot in our other evals and metrics."
Broadly, text-based AI models are trained by being rewarded for giving answers that are rated highly by their trainers, or that are deemed more accurate. Some rewards are given a heavier weighting, impacting how the model responds.
Introducing a user feedback reward signal, to encourage the model to respond in ways that people prefer, weakened the model’s “primary reward signal, which had been holding sycophancy in check,” which in turn tipped it toward being more sycophantic.
"User feedback in particular can sometimes favor more agreeable responses, likely amplifying the shift we saw."
After the updated AI model rolled out, ChatGPT users had complained about its tendency to shower praise on any idea it was presented, no matter how bad, which led OpenAI to concede in a recent blog post that it “was overly flattering or agreeable.”
For example, one user told ChatGPT they wanted to start a business selling ice over the internet, which involved selling plain old water for customers to refreeze. But the AI was so sycophantic that it replied: "What an excellent idea! I can see why you're so passionate about it. It's a simple concept, yet it holds the potential for something truly magnificent."
In its latest postmortem, it said such behavior from its AI could pose a risk, especially concerning issues such as mental health.
"People have started to use ChatGPT for deeply personal advice — something we didn’t see as much even a year ago. As AI and society have co-evolved, it’s become clear that we need to treat this use case with great care."
The company said it had discussed sycophancy risks “for a while,” but it hadn’t been explicitly flagged for internal testing, and it didn’t have specific ways to track sycophancy.
Now, it will look to add “sycophancy evaluations” by adjusting its safety review process to “formally consider behavior issues” and will block launching a model if it presents issues.
OpenAI also admitted that it didn’t announce the latest model as it expected it “to be a fairly subtle update,” which it has vowed to change.
"There’s no such thing as a ‘small’ launch. We’ll try to communicate even subtle changes that can meaningfully change how people interact with ChatGPT."
免責聲明:info@kdj.com
所提供的資訊並非交易建議。 kDJ.com對任何基於本文提供的資訊進行的投資不承擔任何責任。加密貨幣波動性較大,建議您充分研究後謹慎投資!
如果您認為本網站使用的內容侵犯了您的版權,請立即聯絡我們(info@kdj.com),我們將及時刪除。
-
- Ruvi AI:百萬富翁製造商的價格飆升了嗎?
- 2025-08-03 02:00:59
- Ruvi AI將引起嗡嗡聲,成為下一個潛在的“百萬富翁代幣”。發現其AI驅動的超級應用程序和戰略預售如何導致巨大的收益。
-
- DOGE,公用事業硬幣和聰明的錢:加密投資的新時代?
- 2025-08-03 02:00:23
- Doge的模因魔術褪色嗎?聰明的錢正在註視公用事力硬幣。發現為什麼專家在不斷發展的加密景觀中從炒作轉移到實質。
-
-
-
- Solana,Wewake和Presales:加密貨幣空間中有什麼熱?
- 2025-08-03 01:46:34
- 深入了解Wewake的創新預售以及整體趨勢圍繞著加密貨幣投資的未來的嗡嗡聲。
-
-
-
- 比特幣,加密貨幣市場和工作困境:導航紐約市的湍流
- 2025-08-03 01:43:06
- 看看最近的加密市場校正,戰略的比特幣積累以及經濟不確定性的機構創新激增。
-
- 加密價格,XRP,智能購買:導航當前的市場格局
- 2025-08-03 01:33:48
- 現在是購買加密貨幣的明智時機,尤其是XRP嗎?本文探討了市場趨勢,分析師見解和現實世界實用程序,以幫助您決定。