I mean that's softmaxing baby! And also, relatedly, that's RL more generally, babeyyy. But I don't think custom instructions/temperature/base model fixes this.
You can push it into another basin, but it's always totalizing. Goals ain't everything, and good ones can be nuanced.