Slow Inference Has Zero Market
Andrew Feldman, Co-founder and CEO, Cerebras, interviewed by
@saranormous and
@eladgil (No Priors)
Summary: Cerebras went public at $63 billion with a $20 billion-plus OpenAI backlog after spending eight years convincing the industry that GPUs were the wrong architecture for AI inference. Andrew Feldman argues that speed creates new business categories. Fast internet turned Netflix into a movie studio. Fast AI will do the same to workflows we now think of as fixed. The lesson for operators: a 20x speedup requires a different architecture, the chasm between technical proof and market demand can be a multi-year burn, and the companies winning right now treat their old "speed of light" assumptions as soft limits.
1. The Netflix Test. Speed creates new business categories, the way fast internet turned a DVD-by-mail company into a movie studio. Bandwidth opened up an entirely different business for Netflix. The market for slow inference will end up where dial-up and slow search ended up: at zero. Right now AI is still in the "deliver DVDs faster" phase of its potential.
2. Wafer-Scale Bet. A 15 to 20x speedup over GPUs requires a fundamentally different architecture. Cerebras built a 46,000 square millimeter chip the size of a dinner plate while every competitor was building chips the size of postage stamps. Gene Amdahl tried wafer-scale and failed; the rest of the industry called it impossible. The general principle: a 20x improvement requires a design that looks unlike anything that came before.
3. The $8m-per-month Years. From 2017 to mid-2019 Cerebras spent $8 million a month with no working chip and held a board meeting every six weeks to report "still not working." Each failure analysis got the team slightly closer. The chip yielded in summer 2019, and the founders sat staring at it for half an hour because no one had ever done it. That is what hardware conviction looks like measured in cash.
4. Two Years Too Early. Cerebras solved one of the hardest problems in the computer industry and then watched two years pass with almost no one caring. Gen 1 sold around a dozen units, Gen 2 around 300, Gen 3 will sell tens of thousands. Being blisteringly fast was worthless while AI was still a novelty, because nobody uses a novelty every day. The gap between "we built it" and "the market wants it" decides whether deep-tech companies live long enough to enjoy being right.
5. Bridging The Chasm. New compute architectures usually start with supercomputer customers who love speed and tolerate immature software. Cerebras ran the table at the National Labs, then won oil and gas and pharma, then sovereign G42 placed a $1 billion order. That capital let them rebuild the supply chain and battle-test at scale, because you cannot put $100 million of your own gear in a QA lab. By the time OpenAI and AWS showed up, the capacity was ready.
6. OpenAI In 4.5 Weeks. A $20 billion-plus deal went from "first conversation with Sam" to signed master agreement in 4.5 weeks. Term sheet the night before Thanksgiving, master agreement on Christmas Eve, working seven days a week with multiple law firms. Feldman's takeaway: many of the timelines he assumed were the speed of light in dealmaking were soft. Operators in this market are compressing M&A, financing, and data-center build-outs at the same time.
7. The Professional David. Cerebras is Feldman's fifth startup, and every one has been a David against a Goliath. He frames it as identity: if his mother could buy the product, he does not want to make it. The hard part is loving the underdog role for a decade, well past where most founders quit. If you do not love being a David, do not pick a fight with Nvidia.
8. $30K Of Tokens Per Engineer. Eight months ago Cerebras spent under $1,000 per engineer per year on inference tokens; today it is $25 to $30,000. The engineers who have figured it out run eight or ten agents around the clock, with their own QA agents to compensate for known coding-model weaknesses. They went from 10x to 100x. Most people, Feldman included, are still limping along trying to figure out the workflow.
9. The IPO Trade. Going public swaps technology-savvy venture investors for "my dad" in exchange for a slightly lower cost of capital and a much heavier compliance burden. The new wrinkle: three or four companies, OpenAI, Anthropic, Databricks, can now raise public-market sums in the private market. Everyone else still needs the IPO for legitimacy and the right to sell into public-company procurement. Cerebras had something no one else could offer: the only AI pure-play on the market, with 100% of revenue coming from this exact category.
10. The 1,000-Person Malaise. Companies between 1,000 and 3,000 people quietly stop taking the risks that built them. The culture shifts from "what extraordinary thing can we attempt" to "what can we ship in the next rev." Feldman's stated preference: rather fail in pursuit of the extraordinary than succeed in the ordinary. The corollary is recruiting discipline, because putting a butt in a seat to clear a req is death.
11. When To Quit. The honest moment to stop is when you laid out hypotheses for what it would take to win and every one came back negative. The trap is doing this sequentially, "let me test one more thing," and the slippery slope is a beast in business the way it is in ethics. The defense is other former CEOs who remember what you committed to a year ago and pull you back from the warm water before it boils. Articulate what has to change, put a time frame on it, and let someone hold you accountable.
12. Open Source As Oxygen. Open source kept AI research alive when closed-source frontier models were too expensive to use, and now it pressures the leading labs to stay ahead of techniques shipped out of Chinese projects. The result is an active ecosystem where other people's ideas do interesting things on your hardware. Feldman's filter: if you do not love watching other people's ideas take flight on what you built, the infrastructure business is not for you. The flame stays alive because someone keeps pushing the closed labs.