Joined February 2025
104 Photos and videos
Pinned Tweet
Replying to @HedgieMarkets
you can't hedge a worldview

ALT Gunna Fire GIF

13
3,266
efe retweeted
When we announced @OpenAI o1 some researchers from other labs told me we made a strategic mistake and should have kept it secret so we could accelerate ourselves and pull farther ahead of the competition. Studies like these make me confident we made the right choice.
Together with researchers at Boston Children’s Hospital and Harvard, we published a study in NEJM AI showing how o3 Deep Research helped clinicians revisit previously unsolved rare pediatric disease cases, and find answers for families who had waited years.
14
29
497
24,035
efe retweeted
Announcing a new division of Midjourney called "Midjourney Medical"
1,764
3,278
27,394
8,611,116
efe retweeted

79
76
872
397,098
realizing karpathy was not the person we wanted him to be
Replying to @sporadica
very surprised that even karpathy has been really quiet, despite folks calling him out directly
47
efe retweeted
machines of selectively loving grace
6
70
678
15,883
efe retweeted

76
408
3,049
963,279
hell yeah
I have turned down multiple offers to be a pod bro at Shitadel, NoPoint72, & Millenial. Like why would I give you 80% of the profits for my ideas ?
1
70
efe retweeted
AFM Core Advanced on-device model running on A19 Pro is a sparse model. It's 20B parameters. It's fully Apple designed. It is an MoE but when it processes the prompt, it only loads the parameters needed and locks them in. If it's 20B parameters total, but on a specific request it's only 1-4B parameter total. It only loads in 1-4B for inference and decides them at prefill time. It is fully Apple designed architecture, Google had nothing here.
57
303
2,092
1,421,972
Anthropics way of calculating ARR would be a pain the ass in the second half of this year good luck with that IPO
“Workloads” account for less than 20% of Anthropic OAI revenue and they are rapidly declining.
1
1
119
how much money does labubu have
Replying to @david88lee
I lost $1M yesterday , it’s great and I can pay less short term capital gain tax😄
42
GOAT
Stanley Druckenmiller averaged 30% a year for 30 years without a single losing year Paul Tudor Jones asked him on stage what made him different from everyone else - his answer: size "I put 350% long into one bond trade - 200 to 300% of my fund into a single currency - put all your eggs in one basket and watch the basket carefully" "I've never used a stop loss in 40 years - I exit when the reason I bought changes, not when the price is down" "the world changed on 9/11, the world changed when the wall came down, the world changed election night - these moments set in place two to four-year trends you can play" bookmark and watch it today ↓
40
efe retweeted
1.1. Question 28: What happened 1973 and 1974 when your investment firm lost over half? Charlie: Oh, that’s very simple. That’s very easy. That’s a good lesson. That’s a good question. What happened is the value of my partnership where I was running, went down by 50% in one year. Now the market went down by 40% or something. It was a once in 30 year recession. I mean monopoly newspapers are selling at 3 or 4 times earnings. At the bottom tick, I was down from the peak, 50%. You’re right about that. That has happened to me 3 times in my Berkshire stock. so I regard it as part of manhood. If you’re going to be in this game for the long pull, which is the way to do it, you better be able to handle a 50% decline without fussing too much about it. And so my lesson to all of you is conduct your life so that you can handle the 50% decline with aplomb and grace. Don’t try to avoid it. (applause) It will come. In fact I would say if it doesn’t come, you’re not being aggressive enough. 1.2. “I regard it as a part of manhood. If you’re going to be in this game for the long haul which is the way to do it. You better be able to handle a 50% decline without fussing too much. Conduct your life so you can handle a 50% decline with aplomb and grace. Don’t try to avoid it. It will come. And if it doesn’t come I’d say your not being aggressive enough”.
12
163
1,154
227,970
efe retweeted
Remember when everyone hated/sold $META off 80% because their capex was like 30bn & 50% of CFO...good times
2
1
116
22,234
just fix the siri
time for a new concept, what if Apple made a Fitbit Air competitor? Imagine an "Apple Loop" inspired by the sport loop with a small aluminum puck and a MagSafe style charger...
70
efe retweeted
A video showing what appears to be a naval mine purported to be in Omani waters in the Strait of Hormuz has begun circulating on various social media platforms, following the announcement yesterday from Oman's Ministry of Defense warning mariners of a suspected naval mine in their waters. While the authenticity of this video can not be verified, the mine appears to be most similar to an Iranian Maham-1 moored contact mine.
The Ministry of Defense of Oman issued an alert earlier today following the sighting of a suspected naval mine within the territorial waters of Oman in the Strait of Hormuz advising all mariners to keep a safe distance from suspicious objects and report them to authorities.
41
129
1,090
507,081
sorry but this is fucking ridiculous what do you mean they are using firefighter hoses for a computer
39
efe retweeted

2
13
117
13,168
AGI if this saturates
CumBench v1.0 results are in. Gemini 3.5 Flash ranks #1 on the CumBench benchmark, outperforming much larger models a whole size above it in real-world finish quality. The gap is honestly staggering.
68
this
Replying to @GaryMarcus
It may turn out to be true that other approaches are more effective. But that doesn’t invalidate the progress made by current approaches, right? Surely the exponential adoption rates indicate that current approaches are at least *part* of the ballgame?
57
efe retweeted
There's an arXiv paper from 2 weeks ago that the finops community hasn't absorbed yet. The authors ran identical agentic tasks. Same model. Same prompt. Same context window. Same tool stack. They measured end-to-end token consumption across many runs. The coefficient of variation was extreme. The same task could cost 8 dollars or 240 dollars, 30 times apart, with no change to any input. This is not a bug. It is a feature of agentic execution. The model decides how many tool calls to make. The retrieval layer decides how much context to pull. The verifier decides whether to loop again. Each of these is a stochastic decision conditioned on intermediate outputs the user never sees. Most enterprise AI procurement assumes the opposite. Cloud compute has tight variance around its mean. You can size a budget. You can quote a fixed price. You can sign an MSA on a per-million-token rate card. You can still do that with agents, but not with the old mental model. The piece of this that nobody is pricing yet is what happens when the procurement function at JP Morgan or BCG or Pfizer realizes the math. They will not just ask for cheaper tokens. They will ask for tighter controls on variance: better routing, better caching, better observability, better policy limits, and in some cases dedicated capacity. That is a very different infrastructure requirement from stateless inference. It does not mean cloud disappears. It means the winning cloud looks much more like an execution and control layer for stochastic workloads. The unit of AI cost is not the token, it is the distribution. That is also why I think platforms like @nebiustf and others matter.
The token economy just had its biggest day. Google I/O this morning: Sundar Pichai opened not with a model drop but a token counter. 9.7T/month in 2024. 480T in 2025. 3.2 quadrillion today. so around 330x in two years. and to add on to that: top GCP customers are each processing around 1T tokens per day (1 company, 1day😭) On the same afternoon, OpenAI quietly launched “Guaranteed Capacity” (1-3 year compute contracts for enterprises). @sama own words: “the world will be capacity-constrained for some time.” Two of the most powerful AI companies on earth just told you, on the same day, that demand is outrunning supply and it’s not slowing down. This is Jevons Paradox in real time. Cheaper, faster models don’t reduce token consumption, they explode it. Wrote about the mechanics of this a few weeks ago
13
17
234
46,637