(6/n) We apply our framework across ๐ค 7 models, ๐งฉ 5 diverse reasoning-intensive datasets (math, science, law, multi-step soft reasoning), and various ๐งช prompt interventions, finding that faithful confidence expression remains a significant challenge for LRMs ๐.