Stanford
@CS153Systems '26, Session 3 (Full lecture)
The Future of Voice Systems with @matiii from
@ElevenLabs
00:00 Welcome and Intro
01:31 Origin Story on Discord
05:15 The Dubbing Problem
07:44 Pipeline and Early Pivot
12:38 Building the First Model
15:24 Compute Costs and Patents
17:34 Roadmap Through 2025
22:00 Cascaded vs Fused Agents
30:38 Collaboration Over Competition
35:05 Revenue Growth and Team Design
37:56 Predictable Deployment Engine
42:32 Voice Safety and Watermarking
44:27 Research Bottlenecks Personalization
46:24 Training Tradeoffs Cascade vs Fuse
48:20 Five Year Vision Platform
51:08 Impact Work ALS and Ukraine
54:40 China Distillation and Openness
59:24 Studios AI Voice Economics
01:03:04 On Device Models and Platform Gap
01:04:36 Enterprise Tooling and Wrap Up