first impression of claude 4.8 is it's extremely convincing but still a slopus. tried it to criticize a new project and it identified it fell into a local minima and invented a new parser for when we could've used ast.
almost convinced me, glad i checked myself that ast is not emitted in older versions of the compiler we are targeting. codex chose a gnarly but ultimately justified approach. claude didn't bother to verify any of its claims and has used absolutist language like "delete
analysis.py", which is basically 80% of the codebase.
when presented with evidence:
> That contradicts my earlier byte-count check, and it matters enormously
> My earlier "v0.2.9" was a double false-positive (a git log -S hit on an internal symbol, plus a verification grep that mis-read a VersionException as success). Corrected in the review with a note owning the error
the biggest bullshitter model in the world! if you rely on claude for anything, god help you.