Zooming out on what Chutes is actually building:
They are not trying to be the next closed frontier lab. They are trying to become the decentralized AI operating system. This is the infrastructure layer that lets anyone run and eventually train open models privately and cheaply at global scale.
Current state (shipping today):
• Strong serverless inference on open models
• Excellent privacy via TEE enclaves plus post-quantum encryption using ML-KEM-768
• Very good developer experience with custom subdomains, SDK, and one-command deploys
• Growing consumer surface including Search powered by Desearch (SN22), Chat, and Fictio
Next phase (clearly in progress):
• Long Jobs: currently Coming Soon for full training-scale workloads, but basic chute.job() background tasks with retries, progress tracking, and file handling are already live in the SDK
• Parallax: the decentralized Mixture-of-Experts training architecture that makes serious fine-tuning and eventually pretraining viable across fragmented GPUs
• Training as a Service: the long-term vision Jon Durbin has talked about
If they execute, Chutes becomes the default place where serious builders and researchers do private, cost-effective AI work. Inference today. Real training tomorrow.
That is a much larger total addressable market than just being another GPU rental marketplace.
10/11
Why This Actually Matters
Chutes is building the decentralized, privacy-first (TEE post-quantum), open-source alternative to centralized AI clouds.
Inference is already live and scaling. The real upside sits in Long Jobs Parallax; Turning decentralized GPUs into a credible training platform. If they execute, this becomes critical infrastructure, not just another GPU rental service.