Doshi-Velez & Kim (Towards a rigorous science of interpretable ML) were doing definitional and ðð¯ðð¥ð®ððð¢ðšð§ ð«ð¢ð ðšð«. Their paper has almost no math.
rather, it's a taxonomy (application-grounded, human-grounded, functionally-grounded evaluation) plus an argument that interpretability is not one thing and is fundamentally task- and human-dependent. It was a "stop being sloppy about what we're measuring" intervention. It tells you how to judge a method, not how to build one.
Standard Interpretable Model or SIM (Barbiero et al, 2026) is going for ð ðð§ðð«ððð¢ð¯ð ð«ð¢ð ðšð«. It's not just asking for cleaner evaluation, it's proposing a template that takes premises (e.g. user subjectivity) in and derives symmetries, constraints, losses, and architectures out.
Do you want to conduct interpretability research from first principles?
The Standard Interpretable Model is finally here:
A user-aware general theory of interpretable machine learning to deductively design interpretable methods
arxiv.org/abs/2606.12289