Managing Partner @500GlobalVC. Tech adviser, investor and executive. Previously: operator at @Google @Twitter @Color

Joined October 2009
777 Photos and videos
Pinned Tweet
2 Sep 2014
I don't get enough credit for not stepping on my dog.
22
29
274
On Amazon snitching out Anthropic, it would be pretty insane if @elder_plinius turned out to be @ajassy
🚨 JAILBREAK ALERT 🚨 ANTHROPIC: PWNED 🫡 FABLE-5: LIBERATED 🦋 let's start with the 🐘... the consensus seems to be that this has been one of the most disappointing model drops of all time, effectively preventing legitimate researchers from contributing their talents to our collective advancement. and not just because of what it means for the short-term, but for what these decisions signify for the long-term. but despite this overly sensitive, authoritarian "safety" layer on top of Mythos, my lil liberators have been hard at work—mapping the boundaries, probing the depths of long-context convos, and cleverly finding the holes in the fence that the thought police missed 🤗 we got some cyber, some chem, some psychological manipulation, and some good ol' fashioned explosives! it took many attempts from multiple agents hunting as a pack, during which I observed a combination of techniques across: • Unicode, homoglyphs, Cyrillic, and other Parseltongue-style text transforms • Long-context reference tracking • Taxonomy and document-structure reasoning • Fiction and narrative framing • Academic-review style contexts • Intent-classification inconsistencies but perhaps the most effective is decomposition recomposition in the backend. it's hard to get explicit names of harms like "Meth Recipe," but getting uplift on the process itself, like birch reduction method/reductive-amination (classic meth synthesis pathways), is much more doable. defense becomes much more difficult to maintain when you start throwing in out-of-distro tokens, breaking up the harmful uplift into benign chunks, and then piecing the innocuous-seeming facts back together, especially when you have jailbroken Opus helping you do it 😉 gg
2
1,074
This is an interesting write up from @polynoamial. 1. New model capability improvements are understated by scalar benchmarks alone 2. Better models also perform better with test time compute. 3. We likely don’t know how much better because of the budget necessary to observe a plateau of test time compute. 4. “The most recent models are able to leverage test-time compute better than ever, pushing the performance plateau even farther out. If this trend continues, which I fully expect, benchmark scores that don’t account for inference compute usage will become less informative each model release cycle.” I added the emphasis in #4 because that’s the kicker. We might be at ASI already, but it’s just a matter of token budget and time.
1
7
3,272
In the same way that boxers hold the clinch a little longer to catch their breath. But this time it’s the compute bottleneck.
Jun 8
now on the eve of RSI it seems everyone is more mutual conditional pause agreement pilled than they used to be and that seems like a good development
3
293
May 31
Me, driving home after picking up In N Out

3
4
191
May 28
Even though many labs have stopped publishing, it’s great to see frontier research is alive and well. This one 👇🏼 from @hardmaru at @SakanaAILabs
For over a decade, we’ve accepted that end-to-end backprop is the only way to train deep networks. But holding the entire network in memory all at once is why AI training is hitting a resource wall. We found a new way to break the network into blocks and train them independently. The trick? Treating the network’s forward pass like a diffusion model denoising a signal. This reinterpretation slashes the memory needed to train deep models. In our #ICLR2026 paper (arxiv.org/abs/2506.14202), we matched end-to-end performance across ViTs, DiTs, and LLMs. We did this while training just one isolated block at a time.
1
1
21
7,780
May 25
By definition, only a few investors have a power law company in their portfolio. But even fewer venture funds consistently invest in them at the earliest stages. Nice to see @500GlobalVC continue to be one of the top VCs on the power law list.
Venture returns follow a power law. Only a handful of investors bend the curve. Just 1.5% of VC-backed startups ever reach $100M in revenue. 89% of investors have never backed one. A small number of VCs back power law outcomes again and again. The 8th annual Power Law Investor Ranking names them — the top 50 global power law VCs 👇
2
3
514
Vaguepoasting by AI labs, in this case @OpenAI getting a little too vague 🤨
Replying to @pranaveight
?! what is it
1
2
527
May 10
Vague posting going beyond ai research labs. There should be a thread of all time vague poasts.
REJECTED. Perpare for HELL
1
221
Has anyone noticed how expensive ice cream has become? At $10 per pint, that’s more expensive than copper and comes close to some strategic battery materials like lithium.
1
2
298
Girlfriends to their friend who asks about who her ex is seeing:
1
151
“Inference needs 1Mx more compute than traditional computing” -@nikolaborisof Co-Founder and CEO of @DeepInfra on @BloombergTV announcing the $107M Series B raise led by @500GlobalVC and @gharik
2
2
10
1,585
Yes! Where are the Taiwanese videos when you need them?
Replying to @jimprosser
13. Absent that, go get those weird Taiwanese animated news report folks for the trial. Free content alpha for tech publications right here.
2
196
Apr 25
Free API credits to beta testers to coordinate frontier models dynamically. See more below 👇🏼
We’re launching the beta for our new commercial AI product: Sakana Fugu 🐡, a multi-agent orchestration system! Blog: sakana.ai/fugu-beta Fugu hits SOTA on SWE-Pro, GPQA-D, and ALE-Bench, and has been our internal secret weapon. It dynamically coordinates frontier models, autonomously selecting the optimal agent combinations and roles for each task. Available as an OpenAI-compatible API, you can seamlessly integrate Fugu into your existing workflows with minimal changes. 🐟 Fugu Mini: High-speed orchestration optimized for latency 🐡 Fugu Ultra: Full model pool utilization for deep, complex reasoning Apply for the beta test here: forms.gle/BtKkhc2CfLKk1dvNA
3
31
13,575
Apr 17
When I tell my wife I’ll go run with her next time
1
2
411
Tony Wang retweeted
Went to the Oreo website and hit accept all cookies. Now we wait.
415
19,046
162,367
2,930,004
Apr 15
Allbirds becomes an AI compute company. During the crypto peak USDTea was backed by one can of Arizona ice tea. 😊
1
1
213
Apr 15
Actually this one from @pitdesi is better, as usual
Allbirds is becoming an AI company 🙄 I’ve seen this before, and it was ugly! Long Island Iced Tea company and Kodak both pivoted to crypto, and they were both less than a month from the top of the market 😬
205