Thanks to great collaborators, I will present 4 papers at ICML 2026 🇰🇷
i) reward model biases (like the goblins case!)
ii) real, though rare, cases where CoT is misleading
iii) mech interp of confidence
iv) base models know how to reason, thinking models learn when ⭐
🧵