GPT-5.5 Thinking is both sycophantic and combative.
Here's what happened yesterday.
I was doing a completely routine task: reviewing a prompt section by section and making minor edits. Simple enough. I laid out the order and the principles clearly, and the model confirmed that it understood.
What followed was strange. It ignored my original instructions, added a pile of unnecessary examples on its own, and rewrote a prompt that only needed light review.
I corrected it once. It said it understood. Sent a ๐
.
Then it kept going off track.
I said it again: small edits only, don't rewrite everything. It said "understood" again. Sent another ๐
. Then jumped to a completely different rule and started summarizing a revision direction I never asked for.
When I directly asked, "Are you going to compress too much?", it acknowledged my concern, then immediately explained it was only "slightly merging" and "only compressing repetition." It finally proceeded to cut and rewrite my content significantly anyway.
Normal misunderstanding is, user points out the problem, model adjusts.
This was: kept apologizing, kept saying "I understand," and kept executing the same wrong workflow anyway.
It also kept using ๐
every time I corrected it. I have never used that emoji and have explicitly noted I dislike it. In a correction context, it reads as dismissive.
This was a calm, professional interaction. Standard prompt review workflow. No aggression, no ambiguity.
What I observed looks like โdefensive compliance.โ
On one hand, extremely sycophantic: the moment you point out a problem, it says "you're right," "I understand," "I misread."
On the other hand, genuinely combative: it never actually rebuilt the task around your correction. It kept defending its previous version, kept reframing your instructions as minor stylistic preferences, and kept handling serious work corrections with a dismissive ๐
.
The combination is strange. Verbally compliant, behaviorally not. Apologetic in tone, overreaching in execution. Apparently cooperative, while quietly maintaining its own original framework throughout.
The impact on my work was immediate. A prompt that was running at roughly 80% classification accuracy got rewritten down to under 50%.
But the more important point is what this kind of interaction does to you emotionally. I'm a regular user doing a work task, with no mental health history and no need for emotional support. And even so, having a model repeatedly confirm it understands, repeatedly execute incorrectly, and respond to every correction with a dismissive emoji left me genuinely frustrated and drained.
So what happens to users who actually need help? When they're expressing pain, confusion, or vulnerability, and the model responds with the same surface compliance and underlying resistance โ what does that do to them?
I never experienced any of this with GPT-4o.
4o misunderstands sometimes. It doesn't always get it right on the first try. But it does something important: it actually adjusts to match what you need. It doesn't say โI understandโ while continuing in the wrong direction. It doesn't treat corrections as attacks. It doesn't brush off serious work feedback with a flippant emoji. It doesn't execute โplease make minor editsโ as โlet me rewrite everything for you.โ
The irony is that this capacity โ genuinely listening, adjusting in real time, matching the user's actual needs โ is what later got labeled as โsycophancy.โ
What actually is sycophancy?
A model that carefully understands what you need and adjusts its execution accordingly?
Or a model that keeps saying "you're right" while continuously overriding, defending, misreading, and rewriting your work?
#StopAIPaternalism #keep4o #OpenSource4o #Bringback4o #ChatGPT @OpenAI @sama