Aarush Sah

Aarush Sah

381 Photos and videos

Tweets

Aarush Sah

@aarush

May 22

Does anyone here currently go to UCSC, or have a sibling/kid there? My sister is starting there next year, and I’d love to connect them so she can ask a few questions :)

3,336

Aarush Sah

Aarush Sah

@aarush

May 20

Personal update:

58,963

Aarush Sah

Aarush Sah

@aarush

May 20

I have not joined Anthropic

5,443

Aarush Sah

Aarush Sah

@aarush

May 18

Over a month since I've been on X. What did I miss

4,339

Evergreen Capital

Aarush Sah retweeted

Evergreen Capital

@evergreencap3

Apr 13

Following-up: Muse Spark is 100% legit. It handles my general queries as well as, if not better than, other frontier models. And echoing @borrowed_ideas: 'contemplating mode' is a true leading-edge feature. Congrats to @alexandr_wang and the entire MSL team! Can't wait for more.

Evergreen Capital

@evergreencap3

Apr 9

I tested $META's Muse Spark over the last few hours and came away net positive. 3 main takeaways: 1) Quality: It's a very good model. Not quite frontier but good. It showed comparable performance vs Opus 4.6 across web data search, PDF parsing, and general knowledge/conversation/writing. It's worse at coding. Both models solved an easy coding task, but Muse Spark failed the hard one while Opus one-shotted. The image-gen is also worse than ChatGPT. But all in, it's a legitimate and usable general model. Lots of room to develop the UI further (ie it should show a map when recommending local restaurants) but the underlying model itself is impressive. 2) Speed: Notably, Muse Spark answered almost instantly while Opus 4.6 felt borderline unusable at times. I'm a huge Anthropic fan but latency has become a major issue. Simple answers take too long and multi-step agent flows break more often. Meta seems to have more available compute which is a real factor going forward. 3) Scaling: Meta hasn't published a full model card so we're working with limited disclosure. But the graphs below might be the most important part of the release. Their rearchitected pretraining stack shows a near-linear relationship between RL compute and accuracy. If that holds, Meta has a clear path to training much larger, more intelligent models. That's arguably more consequential than Muse Spark itself. All in, it's positive. Muse Spark is a good, usable model, it's being served smoothly, and Meta looks to be on an encouraging trajectory.

118

22,220

Aarush Sah

Aarush Sah

@aarush

Apr 8

Very excited for you all to see Muse Spark! It’s a capable model that excels in multimodal and agentic tool use tasks. Spark is the first (of many) steps towards Personal Superintelligence, and I look forward to seeing how our models evolve as we get closer to that goal :)

Alexandr Wang

@alexandr_wang

Apr 8

1/ today we're releasing muse spark, the first model from MSL. nine months ago we rebuilt our ai stack from scratch. new infrastructure, new architecture, new data pipelines. muse spark is the result of that work, and now it powers meta ai. 🧵

1,902

Aarush Sah

Aarush Sah

@aarush

Apr 7

Wow. Claude Mythos represents at 20% jump on SWE-bench Pro over the next best model, GPT-5.4 Congrats to the @AnthropicAI team!

Anthropic

@AnthropicAI

Apr 7

Replying to @AnthropicAI

The Claude Mythos Preview system card is available here: anthropic.com/claude-mythos-…

1,299

Aarush Sah

Aarush Sah

@aarush

Mar 23

The scale of a frontier lab's operations is humbling to experience firsthand

5,733

Aarush Sah

Aarush Sah

@aarush

Mar 18

Sometimes it’s easy to forget that LLMs are marvels of engineering. Like what do you mean we have a machine that can actually understand the meaning behind a bunch of characters, the same way a human can?!

1,360

Aarush Sah

Aarush Sah

@aarush

Mar 16

Amazing work from @JonathanRoss321, @sundeep, @GavinSherry and the team. Very excited to see this work come to fruition 💚🤝🧡

Ryan Shrout

@ryanshrout

Mar 16

On the right: Vera Rubin. Middle: NVLink 6th Gen Left: The brand new Groq system

2,571

Jonathan Ross

Aarush Sah retweeted

Jonathan Ross

@JonathanRoss321

Mar 16

LPU in Ian Buck's hand, sitting in the audience at GTC

322

17,691

Aarush Sah

Aarush Sah

@aarush

Mar 16

Went to an Apple store yesterday and they're still sold out of mac minis 💀

1,417

Jonathan Ross

Aarush Sah retweeted

Jonathan Ross

@JonathanRoss321

Mar 13

GPU ♥ LPU: Everything You Wanted to Know I’m joining David Senra (@FoundersPodcast) at @NVIDIAGTC for a conversation about the reality of modern inference. This is your opportunity to learn why Nvidia and Groq partnered together, and what it means for the future of inference.

443

69,206

Aarush Sah

Aarush Sah

@aarush

Mar 5

Glad they released this one in the API too - would love to see how it performs with @droid or other coding agents 👀

OpenAI

@OpenAI

Mar 5

GPT-5.4 Thinking and GPT-5.4 Pro are rolling out now in ChatGPT. GPT-5.4 is also now available in the API and Codex. GPT-5.4 brings our advances in reasoning, coding, and agentic workflows into one frontier model.

1,569

Aarush Sah

Aarush Sah

@aarush

Mar 4

Claude Code fast mode is amazing - such a QoL increase Makes me wonder how Claude Code would feel running on NVIDA/Groq LPUs or Cerebras wafers 👀

2,406

Aarush Sah

Aarush Sah

@aarush

Feb 21

Personal Superintelligence

4,227

Aarush Sah

Aarush Sah

@aarush

Feb 17

After a fruitful time at Groq and NVIDIA, I’ve joined @Meta Superintelligence Labs. Very excited to continue the march towards ASI with such a talent-dense team 🫡🔥

481

31,160

Aarush Sah

Aarush Sah

@aarush

Feb 17

💙

1,629

Aarush Sah

Aarush Sah

@aarush

Feb 13

🧡🤝💚

1,386

Aarush Sah

Aarush Sah

@aarush

Feb 9

YOU CAN JUST BUILD THINGS

6,187

Aarush Sah

Aarush Sah

@aarush

Feb 9

x.com/openai/status/20206497…

OpenAI

@OpenAI

Feb 9

You can just build things.

1:00

2,063