🚀Varying Shades of Wrong: When no correct answers exist, can alignment still unlock better outcome?
Introducing wrong-over-wrong alignment, where models learn to prefer "less-wrong" over "more-wrong". Surprisingly, aligning with wrong answers only can lead to correct solutions!