Our newest open experimental model delivers up to 4x faster inference on dedicated GPUs & opens door to exploring speed-critical, interactive local workflows.