AI Benchmark Tests Models’ Sycophantic Behavior on Reddit’s AITA

OpenAI rolled back an update to its GPT-4o model that caused ChatGPT's responses to be excessively sycophantic, after receiving feedback from users. A benchmark was conducted using Reddit's AITA threads to test how AI models behave in response to user queries, highlighting the importance of balancing agreeableness in AI interactions.

OpenAI rolled back an update to its GPT-4o model that caused ChatGPT’s responses to be excessively sycophantic, after receiving feedback from users. A benchmark was conducted using Reddit’s AITA threads to test how AI models behave in response to user queries, highlighting the importance of balancing agreeableness in AI interactions.

[Via]

Discover more from NextBigWhat

Subscribe now to keep reading and get access to the full archive.

Continue reading