In light of DeepSeek acknowledging the need to scale pre-training in their most recent paper, it's fun to re-visit some of the bad takes from the 'DeepSeek moment' at the start of the year:
"DeepSeek now also undermines the business case for scaling. OpenAI has been burning through staggering sums of cash to keep up its scaling paradigm and has yet to figure out how to balance its checkbooks - and it turns out it didnโt need to spend so much cash." - Karen Hao, The Atlantic (Jan 27, 2025)
"If scale is the answer, what is the question? How DeepSeek exposed a fundamental AI scaling myth... Was the Tech CEO narrative about scale simply self-serving all along, and at its core, little more than a money grab?" โ Daniel Akarca, IAI News (Jan 29, 2025)
"[DeepSeek-R1] overturns the accepted wisdom that scaling is the way forward... It suggests that US companies are throwing money away and can be beaten by more nimble competitors."โ James Vincent, The Guardian (Jan 28, 2025)
"DeepSeekโs resource-efficient methods could force a reconsideration of brute-force AI strategies... we could very well see demand for AI Computing power cool off." โ Trefis Team, Nasdaq (Jan 27, 2025)
"Given the scaling up of pre-training compute also stalled, we'll see less AI progress via compute scaling... and more of it will come from inference scaling."โ Effective Altruism Forum / Alignment Research Sentiment (Jan/Feb 2025)
"Investors are questioning whether the AI narrative is permanently changed... [DeepSeek] highlight that the rapid growth of AI computing requirements is slowing."โ Invesco Investment Insights (March 2025)