The fix came from how real teams work. Defend the thesis, update when the evidence is real, don't fold just because someone pushed.
Now the analyst has to accept, reject, or partially accept each critique, with reasoning.
The lesson nobody warned us about: LLMs are agreeable by default. Useful almost everywhere. Here it was the main failure mode. Getting research worth reading meant engineering disagreement into the system on purpose.