I'm running codex xhigh, on a stop hook I'm sending the diff and some notes to claude on xhigh (not max), and between the two of them its shocking how many issues they find. Go back and forth 2 - 5 times working it out. Last week I would have done gpt xhigh, some tests, boom, I'm done.