📣Excited to finally share our latest work on quantifiably adapting model behavior based on unique preferences 📣
We teach language models to adjust their clarification behavior using scalar coefficients and find they can generalize to unseen coefficients at inference time!
Newish work (arXived in Dec.):
Prompts can be ambig, but handling ambig. is context/user dependent. Sometimes the right thing is to ask a clarifying question, sometimes to give multi. answers, and sometimes to just guess. Can we train models to change their strategy per context?