AI-scribe evals should not stop at whether the note sounds right. The real failure is propagation: a wrong sentence becoming a med list, follow-up task, billing code, or research field. Score where the error can travel, who reviews it, and how it gets reversed.