The book is finally out: The Apple Neural Engine Inference Book
A practitioner's guide, complete with converters, Swift runtimes, and validated model manifests.
Every model in this repo runs 100% on the Neural Engine (verified with MLComputePlan). No GPU fallback. No CPU matmuls.
Link in comments