had a lot of fun with this project! using features to both explore training data and intervene during training is so simple and powerful. very excited to develop this further!
Have you debugged your training data? You might not like what you find.
Introducing predictive data debugging: reveal and shape what your model will learn before training.
In DPO datasets, we found broken guardrails, hallucinations, and fish fart fan fiction (seriously). (1/9)