Some sessions Opus 4.7 is an absolute genius.
Other sessions it is mentally handicapped.
The work is fundamentally the same, only difference is LLM variance & potentially Claude nerfing it to handle extreme load.
Still useful but very unreliable.
Claude Opus 4.7 has been really off lately.
I'm running it on xhigh effort with 1M context on the Max plan and it's just not performing.
Struggles with debugging.
Loses track of context mid session.
Takes multiple attempts on bugs that GPT 5.5 one shots.
I've been using Claude Code daily for months.
This model has felt like a miss from Anthropic.
Mythos has a 15% chance of releasing by June 30 on Polymarket.
Anthropic, we need it sooner.
Opus 4.7 is not holding up.