Joined September 2022
381 Photos and videos
Does anyone here currently go to UCSC, or have a sibling/kid there? My sister is starting there next year, and I’d love to connect them so she can ask a few questions :)
6
1
9
3,336
Personal update:
9
1
90
58,963
I have not joined Anthropic
2
41
5,443
Over a month since I've been on X. What did I miss
3
8
4,339
Aarush Sah retweeted
Following-up: Muse Spark is 100% legit. It handles my general queries as well as, if not better than, other frontier models. And echoing @borrowed_ideas: 'contemplating mode' is a true leading-edge feature. Congrats to @alexandr_wang and the entire MSL team! Can't wait for more.
I tested $META's Muse Spark over the last few hours and came away net positive. 3 main takeaways: 1) Quality: It's a very good model. Not quite frontier but good. It showed comparable performance vs Opus 4.6 across web data search, PDF parsing, and general knowledge/conversation/writing. It's worse at coding. Both models solved an easy coding task, but Muse Spark failed the hard one while Opus one-shotted. The image-gen is also worse than ChatGPT. But all in, it's a legitimate and usable general model. Lots of room to develop the UI further (ie it should show a map when recommending local restaurants) but the underlying model itself is impressive. 2) Speed: Notably, Muse Spark answered almost instantly while Opus 4.6 felt borderline unusable at times. I'm a huge Anthropic fan but latency has become a major issue. Simple answers take too long and multi-step agent flows break more often. Meta seems to have more available compute which is a real factor going forward. 3) Scaling: Meta hasn't published a full model card so we're working with limited disclosure. But the graphs below might be the most important part of the release. Their rearchitected pretraining stack shows a near-linear relationship between RL compute and accuracy. If that holds, Meta has a clear path to training much larger, more intelligent models. That's arguably more consequential than Muse Spark itself. All in, it's positive. Muse Spark is a good, usable model, it's being served smoothly, and Meta looks to be on an encouraging trajectory.
1
12
118
22,220
Very excited for you all to see Muse Spark! It’s a capable model that excels in multimodal and agentic tool use tasks. Spark is the first (of many) steps towards Personal Superintelligence, and I look forward to seeing how our models evolve as we get closer to that goal :)
1/ today we're releasing muse spark, the first model from MSL. nine months ago we rebuilt our ai stack from scratch. new infrastructure, new architecture, new data pipelines. muse spark is the result of that work, and now it powers meta ai. 🧡
2
24
1,902
Wow. Claude Mythos represents at 20% jump on SWE-bench Pro over the next best model, GPT-5.4 Congrats to the @AnthropicAI team!
Replying to @AnthropicAI
The Claude Mythos Preview system card is available here: anthropic.com/claude-mythos-…
2
1
10
1,299
The scale of a frontier lab's operations is humbling to experience firsthand
2
36
5,733
Sometimes it’s easy to forget that LLMs are marvels of engineering. Like what do you mean we have a machine that can actually understand the meaning behind a bunch of characters, the same way a human can?!
17
1,360
Amazing work from @JonathanRoss321, @sundeep, @GavinSherry and the team. Very excited to see this work come to fruition πŸ’šπŸ€πŸ§‘
On the right: Vera Rubin. Middle: NVLink 6th Gen Left: The brand new Groq system
38
2,571
Aarush Sah retweeted
LPU in Ian Buck's hand, sitting in the audience at GTC
9
22
322
17,691
Went to an Apple store yesterday and they're still sold out of mac minis πŸ’€
3
8
1,417
Aarush Sah retweeted
GPU β™₯ LPU: Everything You Wanted to Know I’m joining David Senra (@FoundersPodcast) at @NVIDIAGTC for a conversation about the reality of modern inference. This is your opportunity to learn why Nvidia and Groq partnered together, and what it means for the future of inference.
17
41
443
69,206
Glad they released this one in the API too - would love to see how it performs with @droid or other coding agents πŸ‘€
Mar 5
GPT-5.4 Thinking and GPT-5.4 Pro are rolling out now in ChatGPT. GPT-5.4 is also now available in the API and Codex. GPT-5.4 brings our advances in reasoning, coding, and agentic workflows into one frontier model.
1
13
1,569
Claude Code fast mode is amazing - such a QoL increase Makes me wonder how Claude Code would feel running on NVIDA/Groq LPUs or Cerebras wafers πŸ‘€
1
1
21
2,406
Personal Superintelligence
2
2
43
4,227
After a fruitful time at Groq and NVIDIA, I’ve joined @Meta Superintelligence Labs. Very excited to continue the march towards ASI with such a talent-dense team 🫑πŸ”₯
66
7
481
31,160
πŸ’™
1
9
1,629
πŸ§‘πŸ€πŸ’š
8
1,386
YOU CAN JUST BUILD THINGS
23
6
96
6,187

Feb 9
You can just build things.
1
2,063