Siva

Siva

41 Photos and videos

Tweets

Pinned Tweet

Siva @ergodicthought

7 Feb 2025

1/ Let's unwrap why the notion of such an evaluation benchmark for AI models is irredeemably flawed, and how it promotes cargo cult mania...

Dan Hendrycks

@hendrycks

23 Jan 2025

We’re releasing Humanity’s Last Exam, a dataset with 3,000 questions developed with hundreds of subject matter experts to capture the human frontier of knowledge and reasoning. State-of-the-art AIs get <10% accuracy and are highly overconfident. @ai_risk @scaleai

14,829

Siva

Siva @ergodicthought

10h

Eh. Half the things are true, but his fundamental premises about the world are arguably wrong. 2026 is not 1996 or even 2016! If someone could not forecast a frontier model export control ban among the range of credible possibilities, their inputs are strategically dangerous.

Amit Ranjan

@amitranjan

14h

x.com/i/article/206679388052…

Siva

Siva @ergodicthought

17h

RT @prasannavishy: “I have never seen anyone move at this pace, let alone the government. I’m flabbergasted, honestly,” said the founder of…

India’s Rs 1 lakh crore R&D fund: speed was the signal, but Rs 5 lakh crore is the point

The Technology Development Board ran through its budget in weeks. Approved VCs are next. The harder ask, persuading one more stakeholder to respond to the big bet, falls to Saurabh Srivastava and...

the-ken.com

126

Siva

Siva @ergodicthought

Jun 15

Congrats to Sarvam on the raise. Excited to see @hcltech invest in a forward-thinking manner 🚀🚀

Sarvam

@SarvamAI

Jun 15

We're thrilled to announce that we have raised $234M in the first close of our $300M Series B at a $1.5B valuation. @HCLTech and @BessemerVP have joined us in this round, alongside continued support from @khoslaventures and @peakxvpartners For countries and companies, sovereign control on the AI stack is no longer an optionality. Sarvam will be the partner of choice for this aspiration. The capital allows us to accelerate our momentum towards this full stack of models, compute, and deployments. A huge thank you to our customers, partners, investors, and the Sarvam team for your trust and belief in what we are building. We’re just getting started. Read more: sarvam.ai/announcing-series-…

Siva

Siva @ergodicthought

Jun 15

Amadahl's law for human progress? 🤔

Matt Clancy @mattsclancy

Jun 15

Important post for metascience from @jddwor. He runs a simulation where AI accelerates some but not all fields, and progress requires combining discoveries across fields. He finds shifting resources into AI accelerated fields slows long-run progress. open.substack.com/pub/abunda…

955

Siva

Siva @ergodicthought

Jun 15

Pushing inference frontier (infra/algos) is just as imp as training best models. Any cracked BLR folks doing this? DeepSeek might not match Claude on benchmarks, but wins handily on intel per unit cost. Like any engg that's where the future lies. Esp if you believe in scaling.

Siva

Siva @ergodicthought

Jun 15

This post is about so much more than "hacking" "be curious lol" 🌟

nisarga

@ni5arga

Jun 14

everyone wants a hacking roadmap. the problem is that roadmaps don't create hackers. curiosity does. ctfs are changing, ai is everywhere, and the game looks very different now. how i'd start hacking in 2026: ni5arga.com/blog/posts/so-yo…

Jaya Gupta

Siva retweeted

Jaya Gupta

@JayaGup10

Jun 13

If you’re a software engineer worried about AI eating your job, become the person who can deploy, customize, evaluate, and operate *****open-source models***** inside companies. Organizations are finally optimizing for AI cost, privacy, and control and many will want this capability in-house.

859

53,759

sphinx

Siva retweeted

sphinx

@protosphinx

Jun 13

EVERYTHING IS BOM

sphinx

@protosphinx

16 Sep 2023

Open Source Manufacturing. Design global. Manufacture local. Starts with an open source BOM directory.

121

1,854

81,136

Siva

Siva @ergodicthought

Jun 14

$250M is easy if Indian IT industry dared to come together and pitch in to de-risk an existential crisis. Infosys alone bought back shares worth ~$200M in Nov 2025. Why are Indian model makers having to pitch global VCs? The market is in India, and the capital is too!

Hemant Mohapatra

@MohapatraHemant

Jun 13

To train a GPT class 1T model from scratch - including failed runs, data acq clean rlhf, post-training, team/people will likely req $250M of compute on an aggressive 3-4mo schedule (i.e. more reserved GPUs), $500-600M all-in IF you do a dense one. MoE fp8 will cut costs by 1/10th depending on how many active params you have. If you want SOTA however, the budgets go significantly higher on test-time compute, post-training RL, and data/synthetic generations..and v. high on talent. Maybe $2-4B all-in. After that comes serving the model. The talent is key to get to SOTA/beat it - and then you have to ensure this is useful enough to have inference vol over time - for which the capital will come if there is usage / TAM. So this is not as much about raising $50-60B, or raising it all at once as the OP says - we are investors in mistral, sarvam, reflection and anthropic - and they all scaled capital over time as models got adoption, but the early bottleneck is more on talent GPUs at that scale where you can do interesting things.

115

Siva

Siva @ergodicthought

Jun 14

It's important that the conversation be driven by *those who know how to do* rather than those who fear that everything is too hard and so oscillate between outrage & passivity.

@pHequals7

Jun 13

the FUD on training economics in India is borderline harmful You don’t need multi billion clusters.. american open source leader has trained their 400B model by raising ~$50M (they have since then raised a bigger round) we should celebrate competence not jingoism

pH

Siva retweeted

@pHequals7

Jun 13

Mark McQuade

@MarkMcQuade

Apr 1

Today we drop Trinity-Large-Thinking. SOTA on Tau2-Airline, frontier-class on Tau2-Telecom, and the #2 model on PinchBench, right behind Opus. On BCFLv4, we're in the mix with the best. 26 people with under $50M raised and a ruthless pursuit of greatness. What this team just pulled off is nothing short of incredible. One hell of an accomplishment and I couldn't be more proud of Arcee. And we've got more to prove.

5,182

Siva

Siva @ergodicthought

Jun 13

Why haven't Indian IT companies come together to create a consortium to invest in Indian coding AI models? Can do a great job here just by fine-tuning open weights models. Needs only a few million USD. Infosys spent 18,000 crore on stock buyback in Nov 2025

Mohandas Pai

@TVMohandasPai

Jun 13

PM @narendramodi Sir we need an India AI Mission under you with @NandanNilekani as vice chair and others from the private sector and govt. to Help India tackle the AI Revolution. We are way behind and need a national mission to get going quickly. Existing govt programs are too slow, way too small to make any large impact. We need an annual 50000 cr fund for deep tech and AI, a 200,000 cr ELGS Guarantee Fund to build Hyper cloud, hardware and chips. @AshwiniVaishnaw @nsitharaman @PiyushGoyal @FinMinIndia @RBI We need a Very Large National Mission. @AmitShah @amitmalviya

Siva

Siva @ergodicthought

Jun 13

RT @HarveenChadha: 2 labs are not enough for a country like India, we need more labs, more gpus, more research/infra engineers, more collab…

146

Siva

Siva @ergodicthought

Jun 13

RT @HarveenChadha: I wish I could do a podcast with people who were against building frontier models on why they didn't foresee this coming…

236

Siva

Siva @ergodicthought

Jun 13

Chips are not inf replicable bits. Multi-yr backlog of GPU orders; industry can't make enough It's not about having money to pay for chips; it's about owning productive capacity (having invested in infra several years before you needed chips) Just a childish tantrum otherwise

Neeraj Khandelwal

@neerajKh_

Jun 13

Today, I am not gonna sleep peacefully. The gap between two civilisations will accelerate to unimaginable levels if one has access to super intelligence and the other doesn’t. As a nation, why can’t we buy 200,000 chips like tomorrow and start training.

Siva

Siva @ergodicthought

Jun 13

Suppose you got chips. Then you need data centers. Then energy. Then AI talent to build sw infra to train models. Then huge quantities of highly curated data. Years of focused deep tech building; not just "app layer" and quick returns... and throwing a hissy fit one fine day

Soham Sankaran

Siva retweeted

Soham Sankaran

@sohamsankaran

Jun 13

As India realizes that we may be cut off from frontier AI, some demand nationalization of AI R&D with the architects of this mess in charge. Lunacy! We must overcome our technological cowardice via private enterprise with the free market as the judge. infinitesunrise.com/p/overco…

Overcoming India's technological cowardice

How can we learn to invent tomorrow?

infinitesunrise.com

1,879

Siva

Siva retweeted

Siva @ergodicthought

Apr 29

(circa ~1800) "India doesn't need to lead the world textile industry; it just needs to import cloth from Britain and ensure that the benefits of cheap cloth are widely shared. It's very lucrative to supply raw material (cotton) to the Lancashire mills."

Nandan Nilekani

@NandanNilekani

Apr 29

India doesn't need to lead the world in building the most advanced AI models. But it must lead in ensuring benefits of AI are widely shared. @rvenk and I have an op-ed in The @EconomicTimes economictimes.indiatimes.com…

495

Siva

Siva @ergodicthought

Jun 13

One cannot solve a problem with the same level of thinking that created it...

Mohandas Pai

@TVMohandasPai

Jun 13

339

Siva

Siva @ergodicthought

Jun 13

The fate of a community is shaped by the quality of its elites...

The Transcript

@TheTranscript_

Jun 12

Elon Musk in this 2012 interview: " My proceeds from PayPal after tax were about $180M, $100M of that went into SpaceX, $70M into Tesla, and $10M into SolarCity and I literally had to borrow money for rent." $SPCX $TSLA

3:43