One of my personal favorite features announced at WWDC will I suspect be a sleeper hit: container machines, allowing your Mac to run a lightweight, persistent Linux environment with your home directory and repos automatically mounted: github.com/apple/container/b…
Dr. SHAP-AV has been accepted to Interspeech 2026 in the Long paper Track (acceptance rate ~29%)! More info (code,ckpts etc.) in the project website: umbertocappellazzo.github.io…
Look forward to presenting it in Sydney ( 2 regular papers I co-authored)🇦🇺🦘🐨
Second big release from us today: Nemotron-3.5-ASR-Streaming!
🌎40 languages
⚡️80ms - 1s controllable latency
🔥240 - 2400 concurrent streams on 1xH100
🧱FastConformer Cache-Aware RNN-T architecture
huggingface.co/nvidia/nemotr…
Introducing Claude Opus 4.8: it builds on Opus 4.7 with sharper judgment, more honesty about its own progress, and the ability to work independently for longer than its predecessors.
Available today at the same price.
ALT Benchmark table showing how Claude Opus 4.8 compares to its predecessor and to other models on tests of coding, agentic skills, reasoning, and practical knowledge work tasks.
I'm attending #MLSys2026 for the full week! We'll be presenting two works.
1️⃣ At the main conference on Tuesday, we'll present TeleRAG, an inference acceleration technique for agentic RAG.
Paper: arxiv.org/abs/2502.20969
Code: github.com/uw-syfi/TeleRAG