Has anyone tried running AI models (CNNs/LLMs, ViTs/ Diffusion) on weird chips? Edge: Qualcomm AR1, Ambarella, TensTorrent Cloud: Trainium, Inferentia, AMD Or even just porting Ampere → Hopper → Blackwell? Curious: how painful was it? Did it kill your project before it started?